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THE PRESTIGE OF INDUSTRIES 


ROBERT E. 


CAMPBELL ! 


Ohio State University 


Since the original study by Counts in 1925, 
there have been numerous occupational pres- 
tige investigations. Probably the most am- 
bitious of these was conducted by North and 
Hatt in 1947. Despite the large number of oc- 
cupational studies, the related area of indus- 
tries has been neglected. To the best of the 
writer’s knowledge, the only study of the pres- 
tige hierarchy of industries was conducted by 
Brayfield, Kennedy, and Kendall in 1953. The 
present study was designed to add to our 
knowledge of industrial prestige and to pro- 
vide a frame of reference for future research 
in this area. 

To accomplish this aim, the investigation 
was concerned with four major questions as 
follows: 

1. What is the nature of the hierarchy of 
industries according to prestige? 

2. Do personal attributes of the respondent 
such as academic training, father’s industry, 
sex, work experience, military experience, and 
the industry he hopes to enter influence his 
evaluation? 

3. What are the major determinants of the 
prestige of an industry? 

4. Do respondents from two different geo- 
graphical regions of the United States agree 
in rating industries? 


METHOD 


Fifty-two industries were selected representing each 
of the 10 major divisions of the Standard Industrial 
Classification System, 1957. The 10 divisions include 
the following: 


agriculture, forestry, and fisheries 
mining 
contract construction 


1The author wishes to thank his advisor, C. L. 
Shartle, for his help with the preparation of this 
study. 


manufacturing 

transportation, communication, electric, gas and 
sanitary services 

wholesale and retail trade 

finance, insurance, and real estate 

services 

government 

nonclassifiable establishments 


The following criteria were used to select the 52 
industries from the 1500 classified: 

(a) Twenty-two industries were selected from the 
earlier study of Brayfield et al. to determine the de- 
gree of agreement between the studies. See the starred 
industries in Table 3. 

(b) Industries that were thought to be well known 
by the respondents. 

(c) Similar industries were chosen to discover the 
rank difference between them. For example, local 
bus transportation and interstate bus transportation; 
public schools and private schools; motels and hotels. 

(d) Industries which reflected our era. This group 
included weapons manufacturing, atomic energy 
plants, labor organizations, and research laboratories. 

(e) The major military services. 

The data were obtained by questionnaire. In addi- 
tion to rating the 52 industries, the respondent was 
requested to answer questions about himself (age, 
sex, college, major, family income, etc.). The rating 
categories were the same as used by North and Hatt 
in their study of occupations. See Table 1 for the 
rating scheme. By using this scheme, the theoretical 
high that an industry could obtain was 100 and the 
low was 20. 

The respondents were cautioned to express their 
own opinion and not to be influenced by one person 
in a particular industry. 

To eliminate communication ambiguity and to de- 
tect flaws in the instrument, a pilot study was con- 
ducted. Form A was administered to 52 Ohio State 
University engineering students. The form was also 
reviewed by a total of 20 students and faculty mem- 
bers for improvement. In the actual study, three dif- 
ferent forms were used. The forms differed only in 
the arrangement of the industries. Form A listed the 
industries in the order designated by the Standard 
Industrial Classification Manual (1957); Form B 
had the industries arranged by alphabetical order; 
and Form C was arranged by random assignment. 
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TABLE 1 


RATING CATEGORIES 


Score 
Rating Value Statement 


100 This industry has excellent prestige 


1 

2 80 This industry has good prestige 

3 60 This industry has average prestige 

+ 40 This industry has fair prestige 

5 20 This industry has poor prestige 

x 0 I don’t know how to rate this industry 


| 


The industries were rated by 360 junior and senior 
students at the Ohio State University. The students 
were randomly selected from the College of Arts and 
Sciences and the College of Commerce and Adminis- 
tration. The Commerce sample was limited to busi- 
ness-oriented students to provide an academic con- 
trast with the Arts and Science students. The students 
represented more than twenty different acedemic ma- 
jors (banking, journalism, advertising, fine arts, pre- 
medical, accounting, etc.). 


TABULATION PROCEDURE 


A rank order of the industries was estab- 
lished for each group of respondents based on 
the mean scores. Thus, the mean scores for 
the men represented one hierarchy of indus- 
tries. Rank orders were also computed for 
women, respondents with work experience, 
those with no work experience, Commerce 
students, and Arts and Science students. 
Rank-order correlations were computed for 
each group comparison (men vs. women, 


Commerce vs. Arts and Sciences, etc.). 

To determine whether the industry of the 
father influenced the respondent’s ratings, the 
respondents were grouped by their father’s 
industry. The respondent was placed in one 


TABLE 2 


SAMPLING SCHEDULE 


College of Arts 


College of 
Commerce and Sciences 
Form Men Women Men Women Sums 
A 40 20 40 20 120 
B 40 20 40 20 120 
he 40 20 40 20 120 
120 60 120 00 360 
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of four industrial groups (manufacturing, 
trade, service or government) depending on 
the father’s industry. The four groups of re- 
spondents were compared for their ratings of 
manufacturing, trade, service and government 
industries. The percentage of ratings above or 
below the industry’s composite mean for each 
group was computed. A chi square test was 
used to measure the group differences. 

The students were also compared to ascer- 
tain whether the industry they hope to enter 
influenced their ratings. The procedure was 
the same as above with father’s industry. 

Army, Navy, and Air Force veterans as 
well as nonveterans were compared to dis- 
cover their agreement on rating the Army, 
Navy, and Air Force. Other services were not 
used since the number of veterans in the sam- 
ple was inadequate. A chi square test was em- 
ployed to compute the differences. 

To evaluate the influence of occupations on 
the ‘prestige of the industries, representative 
occupations from the North-Hatt scale were 
selected. Twenty-two of the industries had 
occupational correlates on the scale. Since a 
number of the industries had more than one 
related occupation, the North-Hatt ranks 
were averaged to derive a mean rank. The 
22 industries and occupations were reranked 
and a rank-order correlation was computed. 

The earlier industrial prestige study con- 
ducted by Brayfield et al. was compared with 
this study. A rank-order correlation was com- 
puted between the 22 industries common to 
both studies. Since the Brayfield study did 
not report a combined rank order for both 
men and women, correlations were computed 
for each sex. 


RESULTS AND DISCUSSION 


The results are presented in Tables 3, 4, 
and 5. Table 3 presents the composite rank- 
ings based on the mean ratings for each in- 
dustry. The weighted ratings ranged from a 
high of 92.2 for Medical Services to a low of 
30.1 for Coal Mining. Although the hierarchy 
shows good dispersion, it should be noted that 
the score distance between several of the in- 
dustries was small. For example, Drug Prod- 
ucts Manufacturing and Religious Organiza- 
tions (difference of 0.8); Real Estate Com- 
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TABLE 3 
RANK OrpeErR or 52 INpusTRIES BASED ON MEAN SCORES OF PRESTIGE RATINGS By 360 COLLEGE STUDENTS 


Mean 


Industry Score 


®Medical services 92.2 
Colleges and universities 
Research laboratories 
Atomic energy plants 
*Banking 
Private schools (secondary) 
*Commercial airlines 
Television broadcasting 
*Chemical products mfg. 
*Aircraft mfg. 
*Motion picture industry 
*Federal government 
*Drug products mfg. 
Religious organizations 
*Publishing companies 
Insurance companies K 
Petroleum refining 69.4 
*Real estate companies 66.7 
*Automobile mfg. 66.6 
Air Force 66.4 
*Telephone companies 64.8 
Public schools (secondary) 64.7 
*Hotels 64.4 
Navy 64.4 
Ship and boat building 63.8 
*Electric power and light 
companies 


~ 
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63.7 


® Denotes industries used by Brayfield et al. 
Note.— Mean rating: 63.6. Standard deviation: 13.56. 


panies, Automobile Manufacturing and Air 
Force (0.3); and Weapons Manufacturing, 
Rubber Products Manufacturing, and Local 
Government (0.3). In light of this it would 
be inadvisable to adhere rigidly to the mean 
ranks. as sharp discrimination points on the 
scale. Further, it can be seen that the dis- 
tance between the industries is neither equal 
nor large. Consequently, it might be more 
meaningful to group the industries on the ba- 
sis of the rating adjectives and mean score 
range (see Table 4). 

Table 5 summarizes the relationships. Most 
of the trait comparisons resulted in high cor- 
relations, which suggests that such influences 
as sex of the respondent, prior work experi- 
ence, academic training, and the presentation 
order of the industries produce only small 
differences in the ratings. 


Mean 


Industry Score 


63.0 
61.8 
61.7 


Marines 

Weapons mfg. 

Rubber products mfg. 
*Local government 
Department stores 
Ceramic products mfg. 
Motels 
*Furniture mfg. 

Clothing mfg. 
Restaurants 

Tobacco mfg. 

Coast guard 
“Building construction 
*Drug stores 

Forestry 

Iron and steel mills 
*Railroads 

Interstate bus transportation 
Army 

“Farming 

Grocery stores 
Meatpacking 

Labor organizations 
*Laundries and dry cleaning 
Local bus transportation 
*Coal mining 


The moderate agreement between Bray- 
field’s Kansas respondents and the Ohio re- 
spondents in this study implies that geo- 
graphic differences play an important role in 


TABLE 4 


INDUSTRIES GROUPED BY RATING 
ADJECTIVE AND SCORE 


Score 


Rating Category Range Industries 


90 and up 
80 to 90 
70 to 80 
60 to 70 
50 to 60 
40 to 50 
below 40 


Excellent prestige 


Good prestige 
Average plus prestige 
Average prestige 
Fair plus prestige 
Fair prestige 

Poor prestige 
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TABLE 5 


SUMMARY OF RELATIONS'TIPS 


Rho 

Correlation 

Variables Coefficient 
Men and women +.95 
Prior work/no prior work +97 
Brayfield/Campbell (men)* +.47 
Brayfield/Campbell (women) +.55 
Commerce/Arts and Sciences +.96 
Within college (Commerce) +.96 
Within college (Artsand Sciences) +.97 
Occupations and industries +-.84 


Agreement among forms +.94 (Kendall's W ) 


* Significant at .05 level of confidence; all others significant 


at the .001 level. 


determining the relative prestige of indus- 
tries. The industries that showed the most 
disagreement for both men and women were 
Farming, Local Government, General Build- 
ing Construction, Chemical Manufacturing, 
Wholesale Drug Companies, and Hotels. The 
largest rank difference occurred for Farming. 
Brayfield’s respondents ranked Farming, I,o- 
cal Government, and General Building Con- 
struction higher. 

Respondents grouped by their father’s in- 
dustry did not show rating partiality for his 
industry, nor did the respondents grouped by 
the industry they hope to enter. 

The three groups of military veterans and 
the group of nonveterans differed significantly 
for rating two of the three military services 
(Army and Air Force). The Army veterans 
showed the most disagreement and had the 
greatest percentage of ratings below the com- 
posite mean for all of the services. The Air 
Force and Navy veterans, as well as the 
nonveterans, rated the three services approxi- 
mately the same and expressed a slight favor- 
ing for the Air Force. 

Forty-two percent of the respondents se- 
lected “it requires intelligence and ability to 
enter the industry” as the influence that con- 
tributes most to the prestige of an industry. 
A large proportion also selected “social stand- 
ing” (21.7%) and 21% chose “it benefits hu- 
manity” as the primary determiner of the 
prestige of an industry. The remainder stated 
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that industrial prestige cannot be attributed 
to any one influence, but that it is due to a 
combination of influences such as wealth of 
the industry, benefit to society, intelligence 
and ability required to enter, and general so- 
cial standing. The writer feels that prestige is 
due to a combination of factors and that it is 
almost impossible to isolate one factor to ac- 
count for prestige. This is supported by the 
composite hierarchy which (Table 3) fails to 
reveal any high correlation with one factor. 
The comparison between the prestige rank- 
ings of occupations of an industry selected 
from the North-Hatt study and the ranks of 
comparable industries in this study indicated 
a strong agreement (+.84). The high corre- 
lation suggests that the respondents were 
probably influenced by the typical occupa- 
tions of an industry. For example, medical 
services, physician; railroads, conductor; coal 
mining, coal miner; and banking, banker. 
Some of the respondents commented that they 
felt they were influenced by the occupations. 


CONCLUSIONS 


The hierarchy of industries in terms of 
prestige ratings is not a sharply defined rank 
by rank continuum. It can better be described 
as a hierarchy progressing in clusters. Each 
cluster is composed of industries that are 
judged to be relatively similar in prestige 
status. 

Respondents compared on the basis of their 
personal attributes, namely, academic training, 
sex, father’s industry, the industry they hope 
to enter, and work experience, were homo- 
geneous in evaluating industries for prestige. 
The only group comparison that did not re- 
sult in high agreement was military experi- 
ence. The Army veterans differed from the 
other veterans and nonveterans and consist- 
ently rated the services below the composite 
mean. 

There are a number of contributing influ- 
ences that determine the prestige of an in- 
dustry. The influences primarily include the 
typical occupations of the industry, the bene- 
fit it gives to society, the intelligence and 
ability required of the employees, and gen- 
eral social standing. 

The respondents from two different geo- 
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graphical areas in the United States showed Bureau or tHe Bupcet, Division or STATISTICAL 


moderate agreement. It appears that the re- STANDARDS. Standard Industrial Classification 
Manual. Washington: U. S. Government Printing 
spondents are influenced by the industrial Office, 1957 
values of their respective areas in evaluating Counrs, G. S. The social status of occupations; a 
industries for prestige. problem in vocational guidance. School Rev., 1925, 
33, 16-27. 
REFERENCES Nortn, C. C., & Hartt, P. K. Occupations ranked 
Brayriecp, A. H., Kennepy, C. E., Jr., & 
W. E. Social status of industries. J. appl. Psychol., Aepiipis ’ 
1954, 38, 213-215. (Received May 18, 1959) 
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SOME CORRELATES OF THE EMPATHY OF THE 
WORKERS WITH THE FOREMAN'* 


URIEL G. FOA 
Israel Institute of Applied Social Research 


It has been suggested that the performance 
of a group such as a working team is influ- 
enced by the extent to which the group mem- 
bers succeed in judging correctly the percep- 
tions and expectations of the other members 
(McMurry, 1953). For this reason research 
findings about the factors which influence 
empathy are of immediate interest to people 
dealing with practical problems of group or- 
ganization. 

This paper is concerned with the empathy 
of the worker toward his foreman when one’s 
perception differs from the other’s. It reports 
on an attempt to predict empathic success or 
failure of the worker from two kinds of fac- 
tors: source of motivation and the character- 
istics of the judged perception. Three motiva- 
tions are considered: the motivation of the 
worker, the motivation that the worker as- 
cribes to the foreman, and the motivation of 
the foreman. A foreman, for example, may be 
motivated to seek reward from management, 
thus orienting his attention toward the cues 
coming from management. For another fore- 
man the rewards coming from the worker are 
more important, and he will therefore orient 
his attention toward the worker. The assump- 
tion is that-one cannot pay attention to all 
the cues to which one is exposed in an inter- 
personal situation and that he will orient his 
attention toward the source which is more 
meaningful to him in terms of potential re- 
wards (Walters, 1958). It is suggested that 


1 This paper is an outgrowth of a project on hu- 
man factors in production initiated with partial sup- 
port from the Ford Foundation. The research re- 
ported here has been sponsored by the Air Force 
Office of Scientific Research of the Air Research and 
Development Command, United States Air Force, 
through its European Office, under Contract AF 
61(052)-121. Reproduction in whole or in part of 
this paper is permitted for any purpose of the United 
States Government. 

Thanks are expressc! to Judith T. Shuval for valu- 
able comments on a preliminary draft of the manu- 
script. 


the empathic response will be influenced by 
these differences in orientation. The results of 
a laboratory experiment described by Lundy 
(1956) show that the ability to judge the 
perception of another person improves when 
the attention of the empathizer is oriented to- 
ward this person. 

Another possible factor related to success 
of empathy is the characteristics of the pre- 
dicted perception. Baker and Block (1957) 
have found that, in a laboratory experiment, 
empathic success is more likely when the be- 
havioral description to be predicted possesses 
socially acceptable traits. 


METHOD : 
The Test of Foreman-Worker Relationship (Foa, 
1957) was administered to 361 Israeli factory work- 
ers and their 51 foremen. This test consists of two 
sets of pictures of eight situations each. One set is 
concerned with the behavior of the foreman, and the 
other with the behavior of the worker. Both sets of 
situations were presented to the foreman and the 
worker. For each situation they were instructed to 
choose, out of four given types of behavior, the one 
which (a) ought to occur in an ideal situation, and 
(b) the one which occurs most often in practice in 
their department. Further, each worker was re- 
quested to guess the alternatives chosen by his own 
foreman, both at the ideal and the actual levels. 

Scoring. (1) Real Similarity Score: number of 
times the worker and his foreman gave the same re- 
sponse to the eight situations of a given set of pic- 
tures. (2) Empathy Score: number of times the 
worker succeeded in judging correctly the responses 
given by the foreman to the pictures of a given set. 

Since there are eight pictures in each set, it is evi- 
dent that the maximum score is eight and the mini- 
mum score is zero. 

For each worker four empathy scores were ob- 
tained, referring to four types of action, namely, the 
ideal behavior of the foreman (IF), the ideal behav- 
ior of the worker (IW), the actual behavior of the 
foreman (AF), and the actual behavior of the worker 
(AW). Likewise four real similarity scores, one for 
each type of action, were obtained for each worker- 
foreman dyad. The present analysis is concerned with 
workers having a real similarity score lower than 4, 
for a given type of action. The perception of these 
workers differs, in most cases, from the perception of 
their foremen. 
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Correlates of Worker—Foreman Empathy 


Within this group of workers, two subgroups are 
compared: (a) Workers who, in most cases, fail to 
empathize, (empathy score below 4). (b) Workers 
who, in most cases, succeed in empathizing (empathy 
score above 4). The empathic outcome thus defined 
constitutes the predicted variable. 

One of the predictors, motivation, was derived as 
following: Each respondent was asked to indicate, 
through the choice of appropriate pictures of the 
test, the source of the motivation of his behavior 
and of the behavior of the other. One of these 
sources is the workers’ group. When this group was 
chosen at least twice out of four possible times, the 
behavior was said'to be worker-oriented. Otherwise, 
it was said to be non-worker-oriented. 

The second predictor refers to the characteristics 
of the perception of the foreman. Real similarity and 
empathy scores were obtained by comparing this 
perception, respectively, with the perception of the 
worker and the perception that the worker ascribes 
to the foreman. The scores indicate whether the com- 
pared perceptions are alike or different 

The design of the test enables the investigator to 
determine which facets of behavior are different in 
the two compared responses. Three facets are con- 
sidered: (a) focus: behavior can be either instru- 
mental (focused on the job) or affective (focused 
on the personal relationship); (6) activity: behav- 
ior can be active (doing something) or passive (re- 
fraining from doing something); (c) direction: be- 
havior can be positive, directed toward the focus 


7 


(fostering the group goal or strengthening the per- 
sonal relationship), or negative, away from the focus 
(Foa: 1955, 1958b). Aggressive behavior, for exam- 
ple, is focused on the personal relationship, active 
and negative. It was therefore possible to observe 
whether the perception of the foreman, the percep- 
tion of the worker, and the perception ascribed by 
the worker to the foreman were alike or different in 
the value of a particular facet. Further, it was pos- 
sible to find out whether the success of the worker 
in judging the perception of the foreman was re- 
lated to the values of the facets of such perception. 


RESULTS 


The finding reported in Table 1 concerns 
the relationship between motivation and the 
worker’s success or failure in empathizing the 
foreman. 

Motivation of the Worker. When the worker 
is oriented toward his peer group he is some- 
what less likely to succeed in empathizing the 
foreman. This supports the findings of Lundy 
(1956), but in our case, the percentage dif- 
ference is small and not significant statisti- 
cally. 

Motivation of the Foreman. When the fore- 
man is oriented toward the workers, the 


TABLE 1 


PERCENTAGE OF WORKERS SUCCEEDING IN EMPATHIZING BY TYPE OF ACTION AND MOTIVATION 


Type of Action 


Ideal 


All 


Motivation of Foreman 


of Worker 


of Foreman of Worker 


Types 


of worker 


worker-oriented 
not worker-oriented 


11 (18) 
13 (24) 


18 (28) 
21 (19) 


Difference 2 3 


of foreman 
worker-oriented 
not worker-oriented 


33 (15) 
12 (32) 


20 (10) 
9 (32) 
Difference 11 


21 


ascribed to foreman 


worker-oriented 
not worker or oriented 


0 (19) 
22 (23) 


16 (32) 
27 (15) 


22 


Difference 11 


* Difference significant at the .01 level. 


19 (43) 
20 (40) 


12 (147) 
15 (121) 


1 3 


22 (37) 
17 (46) 


21 (97) 
9 (171) 


5 12* 


13 (31) 
23 (52) 


1 (68) 
14 (28) 


7 (150) 
21 (118) 


10 


13 14* 


Note.—The figure in parentheses indicates the total absolute frequency for the given group of workers, 
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TABLE 2 


PERCENTAGE OF INSTANCES OF EMPATHIC SUCCESS OF THE WORKER BY TYPE OF ACTION AND 
FACETS OF FOREMAN’S PERCEPTION 


Facet and Value of the 
Foreman’s Perception of Foreman 
26 (42) 
4 (79) 


Direction: positive 
negative 


Difference 


Focus: 
. relations 


job 


Difference 


Activity: active 19 (47) 
passive 10 (100) 


Difference 9 


Note.—Figures in parentheses indicate the total absolute frequencies of dissimilarity cases. 


to different facets. 


Ideal 


\ 
Type of Action 


Actual 


of Foreman 


30 (271) 
0 (66) 


of Worker of Worker 


31 (45) 
0 (277) 


40 (35) 
4 (72) 
36* 30* 3i* 
20 (69) 
11 (70) 


29 (141) 
13 (187) 


14 (109) 
2 (190) 


9 16* 12* 


19 (93) 
11 (80) 


22 (176) 
17 (154) 


9 (114) 
2 (166) 


8 5 ig 


These frequencies differ according 


In this Table each instance of success is counted as a unit, while in Table 1 each worker is counted as a unit. 


Hence the difference in reported frequencies between the two Tables. 


worker is more successful in empathizing him. 
For all the four types of action combined, 
21% of the workers succeed when the fore- 
man is worker-oriented, as against 9% when 
the foreman is not worker-oriented. The dif- 
ference, 12%, is significant at the .01 level.* 

Motivation Ascribed to the Foreman. While 
worker-orientation in the foreman facilitates 
the empathy of the worker, the orientation 
that the worker ascribes to the foreman has 
precisely the opposite effect. When the worker 
thinks that the foreman is worker-oriented, 
he is less likely to succeed in empathizing him 
than when he thinks that the foreman is not 
oriented toward the worker. For all the four 
types of action combined, 7% of the workers 
who perceive the foreman as worker-oriented 
succeed in empathizing, as against 21% when 
the foreman is not perceived as worker-ori- 


2 Empathy scores for different types of action and 
responses to different situations of the same type of 
action are positively correlated among themselves. 
Since these correlations were disregarded in the com- 
putation of the standard error of the difference be- 


tween percentages, the results of the significance 
tests given here are conservative, ie., somewhat 
biased in favor of the acceptance of the null hy- 
pothesis. 


ented. The difference, 14%, is significant at 
the .01 level. Here, too, as for the orientation 
of the foreman arid of the worker, the differ- 
ences for each one of the four types of action 
are in the same direction. 

In summary, the empathic judgment of the 
worker is facilitated when the foreman is ori- 
ented toward the workers and/or when the 
foreman is perceived by the worker as non- 
worker-oriented. Worker-orientation in the 
worker may also be a hindrance. 

Worker’s Empathy and the Perception of 
the Foreman. The results obtained by Baker 
and Block (1957) suggest that it should be 
easier for the worker to judge correctly cer- 
tain perceptive or ideal images of the fore- 
man than to judge certain other images. 
Three component facets of these images are 
supplied by the test with two alternative 
values for each facet: (a) focus of behavior 
(the alternative values of this facet are: to- 
ward interpersonal relations or toward the 
job); (6) direction of behavior (toward the 
focus or away from it); and (c) activity of 
behavior (active or passive). Let us assume 
that the perception of the foreman is different 
in a particular facet from the perception of 
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the worker, the question is then: Is it easier 
for the worker to empathize correctly when 
the perception of the foreman is of positive 
direction than when it is of negative direc- 
tion? The same question occurs, of course, 
for the two other facets of behavior, focus 
and activity. In Table 2 the values of the 
facets of the perception of the foreman are 
plotted against the percentages of instances 
of empathic success of the worker. The table 
shows that: 


(a) The percentage of empathic suc- 


cesses of the worker is always higher when. 


the foreman’s perception is positive (first 
row of the table) than when it is negative 
in direction (second row). All the percent- 
age differences between the positive and 
negative values of the direction facet are 
significant at the .01 level. 

(b) The percentage of successes is higher 
when the perception of the foreman is fo- 
cused on the job rather than on relations, 
but the percentage difference is significant, 
at the .01 level, only for the actual level of 
action. 

(c) The percentage of successes is higher 
when the perception of the foreman is ac- 
tive rather than passive, but the difference 
is significant, at the .01 level, only for the 
actual action of the worker. 


In summary, it seems that when the fore- 
man perceives the action as positive, focused 
on the job, and active, it is easier for the 
worker to judge correctly the foreman’s per- 
ception. 


DISCUSSION 


The results suggest that the motivational 
orientation of the person whose perception is 
predicted, the orientation ascribed to him by 
the predictor, and the characteristics of the 
predicted perception are all related to success 
of prediction. While all these factors seem to 
contribute to the outcome of the empathic 
process, none appears to have a decisive in- 
fluence upon it. Perhaps it should be expected 
that a highly complex process such as inter- 
personal prediction should be mildly related 
to a relatively large number of factors rather 
than being dependent upon very few decisive 
conditions (Foa, 1958a). It is, however, of 


some interest that the results obtained here 
in the field seem to confirm and extend some 
previous findings obtained in experimental 
situations. 

The present data support to some extent 
the earlier findings of Lundy (1956) that the 
orientation of attention of the judge is related 
to empathy. Furthermore, they show that the 
attention of the other person, the foreman, 
also plays a considerable role. More intrigu- 
ing is the finding that the worker’s feeling 
that the foreman is worker-oriented is likely 
to reduce the worker’s chances for a correct 
prediction. This may be explained by sug- 
gesting that such a feeling may indicate satia- 
tion in the worker with regard to the cues 
coming from the foreman (Gewirtz & Baer, 
1958). A satiated worker seems less likely to 
be perceptive toward the foreman, and his 
ability to judge the perception of the latter 
may be consequently reduced. 

These findings suggest some practical con- 
clusions. It seems that understanding is in- 
creased when the foreman is worker-oriented 
but succeeds in avoiding overidentification 
with the worker’s group. The ability of the 
foreman to assume such an attitude may, in 
turn, be influenced by the attitude of manage- 
ment towards him. If the relationship with 
management requires the whole attention of 
the foreman it may be quite difficult for him 
to orient himself towards the workers. At the 
same time, if the foreman feels that his status 
is insecure, he may look to the workers’ group 
for support, thus producing satiation in the 
worker. A management interested in promot- 
ing good communication between worker and 
foreman will, therefore, avoid monopolizing 
the orientation of the foreman, while provid- 
ing him with support. The creation of condi- 
tions enabling the foreman to perform ade- 
quately as a link between management and 
workers seems, therefore, to be primarily a re- 
sponsibility of management. 

The finding that the facets of the perceived 
behavior are related to the success of the 
empathy confirms the earlier results of Baker 
and Block (1957), obtained under laboratory 
conditions. They suggest that empathy is se- 
lective as much as perception is selective 
(Bruner & Postman, 1957). Such selectivity 
may be a result of previous learning. 
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SUMMARY 


This paper presents some findings regard- 
ing the ability of the worker to predict the 
perception of his own foreman. 

The results suggest that the prediction of 
the worker is more likely to succeed when the 
foreman’s attention is oriented toward the 


worker. Success is also more likely when the 
worker believes that the attention of the fore- 
man is not oriented toward the worker. It is 
also shown that the characteristics of the per- 
ception to be predicted significantly influence 
the outcome of prediction. Thus, certain per- 
ceptions are easier to predict than others. 
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The way in which a man is perceived by 
his peers is of considerable practical and theo- 
retical importance. In the armed forces, peer 
status scores are used as parts of grades, as 
promotional considerations, and as thé basis 
for a variety of personnel decisions. However, 
little is known about some of the contextual 
factors which might affect rating scores. One 
of the goals of the present study was to gather 
information about the effects which some 
variables may have on peer ratings. 

More specifically, this study is concerned 
with the extent to which ratings are affected 
by the: 


(1) kind of task assigned to the group 

(2) particular group to which a man is as- 
signed 

(3) number of sessions the group has been 
together 

(4) nature of the trait being rated 

(5) role of the rater (peer or observer). 


PROCEDURE 


The basic experimental unit of this research was 
a group of six Ss brought together for a 2-hr. pe- 
riod and assigned a task. Toward the close of the 
period, each S rated his five peers and himself on 


1The research reported in this paper was sup- 
ported in whole or in part by Personnel Laboratory, 
Wright Air Development Center, Air Research and 
Development Command, United States Air Force, 
under Contract No. AF 41(657)-157. Permission is 
granted for reproduction, translation, publication, 
use, and disposal in whole and in part by or for 
the United States Government. 

Some tables omitted from this paper, the full rat- 
ing scale, and complete descriptions of the tasks are 
available in Fiske, Cox, and van der Veen (1959). 

2We gratefully acknowledge the assistance of 
Ernest C. Tupes, who contributed to the formula- 
tion of the design, and the assistance of Kenneth 
Howard and William Rechenberg, who helped with 
developing the tasks, running the groups, and ana- 
lyzing the data. 

8 Now at Trinity College, San Antonio. 


seven traits. At the same time, these six Ss were also 
rated by six other men who had observed the group 
throughout the period. 

The same group met together four times over an 
eight-day period. Then each group was dissolved 
and Ss were regrouped. Each S’s associates in the 


‘second group consisted of five peers with whom he 


had not worked previously. Each new group also 
performed four tasks. 

At this point, the men who had been Ss became 
observers and the observers became Ss. The process 
was repeated, with the new Ss working together for 
four sessions in each of two groups. 


Tasks 


Each grouping of six Ss worked on four tasks, 
two physical and two verbal. Each set of four tasks 
consisted of two verbal alternating with two physi- 
cal: half the groups had the order P, V, P, V, and 
half had the order V, P, V, P. Examples of the 
physical tasks are: assembling a set of dowels and 
blocks to form a specified structure; rolling a ball 
up an inclined spiral plane; assembling boards to 
form a bridge. Examples of the verbal tasks are: 
making recommendations on national problems; 
playing the game of “Twenty Questions” as a group; 
playing “Intercept,” a war-game problem. 

The tasks were rotated so that each man worked 
on eight tasks and observed eight different tasks. 
When a new grouping of Ss was made, each S had 
had experience with the same set of four tasks in 
his previous group. 

The experimental sessions were held in classrooms. 


Observers 


All six @s observed and rated all six Ss. However, 
each O was the “primary observer” for one S. He 
was instructed to watch him most closely. Every 15 
min., he rated this one S on his activity level and 
made notes about what his S had done during that 
quarter-hour and about his impressions of S. The 
principal function of this procedure was to maintain 
O’s attention. 

When the six Ss were regrouped, the primary O 
followed his S to the new group. Thus the primary 
O observed the same man for eight sessions but ob- 
served other Ss for only four sessions. 


Subjects 


The basic design called for 144 men. In each of 
the two replications of the design, six groups of six 
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men were observed by 36 observers. As indicated 
above, the Os and Ss reversed roles after two sets of 
four sessions each. 

A few men failed to complete the entire series and 
had to be replaced. It was possible, however, to ob- 
tain essentially complete data on 138 men. 

The Ss were students at the University of Chicago 
recruited by advertisements. They were paid for each 
session. Attendance was maximized by exhortation 
and by offers of bonuses for those completing the 
entire schedule. 

Their mean age was 20.7 years with a range from 
17 to 29. Their mean education was 2.6 years of col- 
lege with a range from just entering college to three 
years of graduate work. 

Before the group sessions began, Ss were given a 
battery of psychological tests in three testing ses- 
sions. They were also given a list of all Ss and asked 
to indicate those whom they knew. From this infor- 
mation, schedules were meticulously worked out so 
that the six men assigned to any one of the 48 work- 
ing groups had no previous acquaintance with each 
other, and so that the six Os had no previous ac- 
quaintance with any of the six men they rated. The 
scheduling could not be perfect: in a few instances, 
an S was known slightly by one of the Os who rated 
him, but never by his primary O. 


Traits 


Of the seven rated variables, four were evaluative 
and referred to the rater’s over-all impression at the 
time of rating: the rater was asked what he thought 
the ratee was like, in general, without specific refer- 
ence to the observed behavior. 

The rating of Liking (No. 1) was designed to get 
at the rater’s general evaluation of the man. (Ss did 
not rate themselves on this trait.) Leadership Poten- 
tial (No. 4) was included because of its importance 
in military personnel work. Maturity (No. 2) and 
Breadth of Interests (No. 3) were selected to ap- 
proximate two factors which have been found in 
analyses of peer rating scales. 

The other three were descriptive of the S’s be- 
havior in each session: 5. Number of Useful Ideas, 
6. Considerateness, and 7. Assertiveness. For these, 
the rater was instructed to rate on the basis of what 
the man did in that particular session. The descrip- 
tive traits were adapted from the three factors which 
Carter (1954) has found to recur in analyses of 
ratings on behavior in groups: “aiding attainment 
by groups,” “sociability,” and “individual prominence 
and achievement.” The findings of Borgatta, Cottrell, 
and Mann (1958) lend further support to the selec- 
tion’ of these traits. 


Analyses 


The approximately 100,000 ratings were punched 
into IBM cards and most of the computational work 
was carried out on IBM machines. 

The basic unit was the score for an S on a trait 
for a session. Three such scores or basic ratings were 
obtained: (a) the mean of the ratings by a man’s 
five peers in his working group; (6) the rating by 


the primary observer; (c) the mean of the ratings 
by the other five observers. (Whenever the unquali- 
fied term “observer” is used, it refers to these five 
raters.) The primary observer’s rating was excluded 
from the O mean so that the latter would be com- 
parable in every respect except role with the peer 
mean. 


Most of the analyses were carried out for each of 
these three ratings on each of the seven traits sepa- 
rately. 

RESULTS 
Effect of Group 


Findings on the effect of group will be re- 
ported first because this effect proved to be a 
major one, which has to be considered when 
studying other effects. 

The basic analysis for the effect of group 
was the intercorrelation (over 138 men) of 
the mean rating per trait for each of the eight 
sessions, arranged in temporal order. Thus the 
intercorrelations within each of the two groups 
(Sessions 1-4 and Sessions 5-8) could be 
compared to the intercorrelations between 
groups. The medians are reported in Table 1. 

It is obvious that the correlations involv- 
ing the same group are substantially higher 
than those between different groups, for the 
peer and O ratings. In fact, for Traits 1-4, 
there is no overlap between the distributions 
of the correlations. The effect of “group” 
could be attributed to changes in Ss’ behavior 
as a function of working with different peers 
or to changes in perceptions as a function of 
change in raters. A glance at the correlations 
for the primary observer, who was the same 
rater for both groupings, indicates that the 
“group” effect is primarily a function of the 
change in raters: when the rater is the same, 
the several ratings for sessions in the second 
group correlate as highly with ratings for the 
first group as the latter do with each other. 


Effect of Task 


To assess the effect of type of task, each 
basic rating was identified by the type of task 
involved, and secondarily by the group (first 
or second grouping for that man) and the 
serial order (first or second task of that type 
within the series for the group). These rat- 
ings were then intercorrelated to indicate the 
relationship between each pair of sessions so 
identified. 

These correlations obviously involve the 
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TABLE 1 
MEDIAN INTERCORRELATIONS BETWEEN SESSIONS INVOLVING THE SAME AND DIFFERENT GRouPS 


Peer ratings (V = 138) 
First group 

Second group 

First vs. second 


Observer ratings (V = 138) 
First group 
Second group 
First vs. second 


Primary observer ratings (V = 108) 
First group 54 
Second group 
First vs. second 58 


same data as those in Table 1, but combined 
in a different way. For example, the first 
physical task encountered by each S was the 
first task chronologically for half the Ss and 
the second chronologically for the other half. 

Inspection reveals no systematic tendency 
for the correlations between sessions involv- 
ing different types of task to be any lower 
than those between the same type of task. In 
other words, the consistency of peers’ percep- 
tions of S’s behavior in different sessions is 
not affected by the type of task (physical vs. 
verbal). 

Of some interest is the effect of type of 
task on mean ratings. For verbal tasks, the 
means of the peer ratings are higher on all 
traits, while the means for O ratings are 
higher on all traits except Traits 1 and 6 (the 
means for primary O are higher only on 
Traits 3, 4, 5, and 7). The differences are 
trivial except for Trait 7 (Assertiveness) and 
Trait 5 (Number of Useful Ideas). 

The major psychological difference between 
the physical and the verbal tasks was in the 
nature of the interpersonal interactions. In 
verbal tasks, everyone interacted continuously 
with everyone else: each person was either 
speaking or listening to a peer. 

In contrast, Ss could not sit for most of the 
physical tasks. Much of the time they moved 
around, collecting and arranging materials. It 
would appear that the greater amount of sub- 


Traits 


.59 52 


oe 41 


stantial interaction contributed to the higher, 
more positive ratings in the verbal tasks, and 
especially for Traits 5 and 7 which apply to 
aspects of overt actions visible to the whole 
group. 


Effect of Time 


Correlations were computed between the ses- 
sions in temporal order. The correlations 
among the four sessions of one group are gen- 
erally of about the same magnitude: no ma- 
jor effects are present. 

There is a slight tendency for the correla- 
tions between the third and fourth sessions in 
each group to be the highest ones in each 
block, but this trend occurs chiefly in the 
peer and observer correlations for the first 
four traits where the correlation between the 
third and fourth sessions always exceeds the 
correlation between the first and second by 
a small amount. No other general trend 
emerges. While it might be expected that 
temporally adjacent sessions might show ele- 
vated correlations, such does not seem to be 
generally true. 

In the mean ratings on each trait for each 
temporal session, no major trends are evident. 


Effect of Trait 


Are the several effects previously examined 
different for different types of traits? Table 1 
shows a clear difference. With but one excep- 
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tion, the correlations between sessions in a 
given group are higher for evaluative traits 
(1-4) than for descriptive (5-7). Thus it ap- 
pears that these general evaluations are more 
consistent across sessions than these specific 
descriptions of S’s behavior. This effect disap- 
pears in the between-group evaluations, where 
the general impressions of different raters 
agree no more highly than do their cescrip- 
tions of behavior. 

The same general picture appears when the 
sessions are grouped by task. However, in 
both analyses, primary Os’ median correla- 
tions for evaluative traits are consistently 
larger than those for. descriptive traits in the 
cross-group comparisons. A primary O’s gen- 
eral impression was affected very little by 
whatever changes in S’s behavior occurred 
when he was placed in a different group. 
Hence the change in raters (for peer and O 
ratings) leads to a change in description of 
an S’s behavior and to an even larger change 
in evaluations. 

The a priori distinction between evaluative 
and descriptive traits was reinforced by the 
instructions on the rating scale. Nevertheless, 
the several traits showed two groupings which 
cut across this dichotomy. In several analyses, 
the data for Considerateness were more simi- 
lar to those for Liking, Maturity, and Breadth 
of Interests than to the data for the other 
two descriptive traits. Ratings on these four 
traits required indirect inference from the ob- 
served behavior. 

In contrast, Assertiveness and Number of 
Useful Ideas involved judgments or evalua- 
tions of observed behavior. Apparently Lead- 
ership Potential was also rated rather directly 
from observation, since its data resemble 
those for these two descriptive traits. 

The means for these latter three traits tend 
to be lower and the standard deviations higher 
than the corresponding values for the four 
more inferential traits. Again, peers and Os 
agreed less well on the four inferential traits. 
(Further evidence on trait differences is given 
in Fiske [1960], and van der Veen and Fiske 
[1960].) 


Discrepancy between Rater and Group Mean 


The extent of agreement between the mean 
ratings and those made by individual raters 


was explored. For each peer, the absolute dis- 
crepancy between his rating and the mean 
peer rating was calculated for each rating he 
made on each trait on each other peer in each 
situation. These were averaged over the seven 
traits to obtain his discrepancy for each S in 
each session. For each session, the mean and 
the variance of his discrepancies for the other 
five Ss were computed. Comparable analyses 
were made for O ratings. 

Each rating was, on the average, within 
one-and-a-half steps (on the 10-step scale) 
of the mean of four or five other raters. This 
mean discrepancy suggests that somewhat dif- 
ferent frames of reference existed in the dif- 
ferent raters, and that for a reliable rating by 
peers, a number of raters should be averaged. 

The analyses of peer discrepancies were 
based on 48 men who participated in eight 
sessions in which no peer was absent. It was 
deemed advisable to use a smaller sample for 
whom complete data were available than to 
use more cases with many estimated values. 

An analysis of variance of the mean peer 
discrepancy scores per session yielded a sig- 
nificant interaction (p< .01) between rater 
and group (first vs. second). Thus the size of 
a peer’s mean discrepancy depended upon the 
particular group with whom he was working. 
When the mean square for raters was tested 
against the mean square for interaction (Rater 
x Group), the F was barely significant at the 
.O5 level. The presence of individual differ- 
ences in discrepancy scores was no surprise 
because individual differences in raw ratings 
had been noted in inspecting the original data. 
Since individual differences between raters ex- 
isted, it seems reasonable to interpret the sig- 
nificant interaction in terms of rater varia- 
tion, rather than ratee variation: a rater’s 
discrepancy from a group mean is a function 
of the raters with whom he is compared. The 
people being rated probably make much less 
difference. 

No significant effect for group (first vs. sec- 
ond) was found. 

In a separate analysis of variance, the mean 
discrepancies were ranked in chronological or- 
der within each group. No effect of serial or- 
der was found. 

There was no mean difference between peers 
who had been in the role of observers previ- 
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ously and peers who had not: mere experi- 
ence in making ratings did not affect the av- 
erage discrepancies. 

The variance over each man’s five peers 
was also analyzed. Since the variances were 
badly skewed, a nonparametric technique 
using chi square was employed (Mood, 1950, 
Ch. 16). The interaction (Rater x Group) 
was again significant but the rater and group 
effects were not significant by themselves. 


Effect of Role 


In the analyses presented above, few dif- 
ferences between peer ratings and O ratings 
have been found. One effect associated with 
the peers’ personal interaction with each other 
is evident in the mean ratings. On every trait, 
the mean peer ratings are above the mean for 
observers or primary observers. (The means 
over all traits are 5.54, 5.11, and 5.03, re- 
spectively.) 

The primary observer ratings seem to re- 
semble the observer ratings. No effects seem 
to be associated with the primary O role 
which cannot be explained by the lower re- 
liability of one rating (as against the mean 
of five) and by the fact that, in the second 
grouping, this man saw the same man he had 
seen before and about whom he had had time 
to develop a rather firm impression. 

The agreement between peer and O ratings 
was investigated. For each of the 192 indi- 
vidual sessions, the peer means and the O 
means were ranked over subjects on each 
trait. These ranks were correlated, using rho. 

The 1,334 values were summarized by trait, 
and subdivided in various ways. No signifi- 
cant effect was found for group (first or sec- 
ond) or for cycle (having had previous ex- 
perience in the opposite role or not). Thus 
again, no effect of experience or time ap- 
peared. 

For each type of task, a study of trait dif- 
ferences was made by ranking the median 
correlations of the seven traits within each 
specific task. Significant trait differences (p < 
.001) were found in each case, using an analy- 
sis of variance of ranks. 

The median correlations for each type of 
task and for all sessions are given in Table 2. 
The medians show essentially the same order 
(rho = .78). As discussed earlier, the trait 


TABLE 2 


MEDIAN CORRELATIONS BETWEEN PEER AND OBSERVER 
RATINGS FOR SEPARATE SESSIONS * 


Trait 
Liking 40 

Maturity 54 

Breadth of interests 54 
Leadership potential 42 
Number of useful ideas ofa j 73 
Considerateness a 49 
Assertiveness 83 84 


Physical 


Verbal 


*® The medians are based on rank correlations for each of 192 
sessions, half of each kind. For each session, N = 6. 


differences are presumably associated with 
the relative specificity of behavioral referents 
in the trait descriptions. Of most interest 
are the high agreement on Assertiveness and 
the comparatively poor agreement on Con- 
siderateness. 

The trait differences are more important 
than the general level of the medians. The 
agreement between raters is largely a func- 
tion of the trait being rated. 

The degree of agreement between peers and 
Os can also be studied by comparing different 
sessions. It was found that ratings by peers 
and Os from different sessions agreed better 
for Assertiveness than for Maturity and Con- 
siderateness. 


Differentiation of the Traits 


The analyses reported above have deter- 
mined the influence of various factors upon 
the peer ratings. This section will consider 
the effect of general impressions as contrasted 
to specific trait designations. 

For the mean ratings in the second group- 
ing, intercorrelations were run for peers and 
for Os separately. For each role, the four 
evaluative traits had intercorrelations be- 
tween .69 and .85. The other values were gen- 
erally lower. The median for all correlations 
was .70; from this, it may be estimated that 
the median intercorrelation for ratings made 
in a single session would be .37. 

The generally high level of these intercorrb- 
lations among traits accounts for much of the 
essential homogeneity of the results for the 
several traits as reported in earlier sections 
(cf. Table 1). On the other hand, the differ- 
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ences between the peer—observer correlations 
for the several traits (cf. Table 2) become 
even more striking since they emerge in spite 
of this high overlap between the traits. 

Another question is the extent to which the 
peers and Os could jointly identify and dif- 
ferentiate several traits. To study this ques- 
tion, the mean ratings of peers and of Os 
for each group were intercorrelated on four 
traits, yielding a multitrait-multimethod ma- 
trix (Campbell & Fiske, 1959) in which the 
sets of peers and Os in the two separate 
groups represent four independent “methods.” 

There is reasonable agreement between 
“methods” when the peers and Os are ob- 
serving the same behavior, or when ratings 
from the same role (peer or O) but different 
groups are correlated. For these comparisons, 
the differentiation of traits is fair. However, 
when peers in one group are correlated with 
Os in another group, we tind lower agreement 
and much poorer differentiation. In all com- 
parisons, the differentiation between Leader- 
ship Potential and Assertiveness was particu- 
larly bad. 

Thus dissimilarity between raters in identity 
and role and differences between the behavior- 
samples observed are factors contributing to 
lower agreement between ratings on a given 
trait and to increasingly poorer discrimination 
between traits. 


DISCUSSION 


From:the results, one general finding 
emerges Clearly: the rating given an S is in- 
fluenced considerably by the particular men 
making it. Furthermore, this effect is prob- 
ably a function of the different frames of 
reference used by different raters. Thus the 
individual differences in mean rater discrep- 
ancies seem to account for the effects asso- 
ciated with group. 

In all probability, this individual rater ef- 
fect is a function of his average rating. While 
Ss as a sample produced ratings with a mean 
only slightly above the center of the scale, 
some Ss concentrated their ratings in the top 
steps, while others spread them out over the 
entire 10 steps. 

While Ss did work seriously on the tasks, 
the typical level of involvement was not high. 
None of the tasks involved Ss’ major areas of 


specialization, so that there was little chal- 
lenge to prove their competence. Most Ss did 
not exert maximum effort, in spite of the prize 
money offered for the groups performing best. 

The general setting which produced this 
moderate level of involvement in the tasks 
produced a similar—or lower—level of in- 
volvement in the ratings: the Ss knew that 
their ratings would have no personal effect on 
themselves or on the peers they rated. 

Thus there were several factors in this ex- 
periment which worked together: the con- 
sistency of the experimental conditions from 
session to session; the moderate to low in- 
volvement of Ss; the absence of feedback (the 
raters did not know how their ratings com- 
pared with those made by others). The total 
effect of these factors probably contributed to 
the consistency of the ratings over sessions, 
over types of task, and over roles. 


SUMMARY 


Men were brought together in groups of 
six to work on assigned verbal or physical 
tasks. Each group spent four sessions together. 
The men were then reassigned to separate 
groups with which they worked for four more 
sessions. At the end of each session, each man 
rated his five peers and himself on seven 
traits. The four evaluative traits were Liking, 
Maturity, Breadth of Interests, and Leader- 
ship Potential. Ratings on the descriptive 
traits-—-Number of Useful Ideas, Considerate- 
ness, and Assertiveness—were based on what 
the man did in that session. The men were 
also rated by six Os. New Os rated the men 
in the second group, except for one primary 
O who followed the same man through all 
eight sessions. 

Intercorrelations among the mean peer rat- 
ings and among the mean observer ratings per 
session yielded the following findings: 1. A 
change in the group with which a rater 
worked was associated with a change in the 
ratings he received. (Evidence is presented to 
support the interpretation that this change 
in rating was primarily a function of the fact 
that different raters rated him in the two 
groups.) 2. The type of task assigned to the 
group had no important effect on the ratings. 
3. The number of sessions the group had been 
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Consistency of Ratings by Peers 


together had no important effect on the rat- 
ings. 4. No general effects associated with 
rater role as peer or O were found. Peer rat- 
ings were, however, systematically more posi- 
tive than those by Os. 

It was also found that agreement between 
peers and Os on ratings in a single session 
was highest for Assertiveness, high for Num- 
ber of Useful Ideas and Leadership Potential, 
lower for Breadth of Interests, Maturity, and 
Considerateness, and lowest for Liking. 

Discrepancies between the ratings made by 
a man and the mean of ratings made by others 
in the same role showed reliable individual 
differences. 

It was concluded that individual differences 
in rater frames of reference contribute to the 
differences between the ratings a man receives 
in different groups. In comparison to this ef- 
fect, such factors as type of task, rater’s role, 
and number of previous sessions have little 
effect on ratings under the conditions of this 
experiment, in which there was only moderate 
involvement on the part of Ss and no oppor- 
tunity to compare one’s ratings with those 


made by others. It was also concluded that 
the degree of agreement between raters is a 
function of rating instructions and trait defi- 
nition. 
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If measurable personality variables are re- 
lated to behavior of drivers, then persons re- 
sponsible for training and licensing drivers 
should consider how this relationship can be 
used in order to reduce the number of high- 
way accidents. Accidents result from a com- 
plex involving, in addition to the driver, the 
machine, the highway, and driving conditions 
such as the weather and the time of day. Any 
systematic programs that will effectively re- 
duce the number of accidents must pay at- 
tention to all of these variables, but to date 
too little attention has been given to the per- 
sonality of the driver. 

In a study of traffic accident repeaters in 
Michigan and Connecticut (ENO, 1948), the 
Cornell Word Form, a type of personality 
test, distinguished to some extent between 
drivers with and without traffic accidents. Re- 
lationships were small. Harris (1950) found 
no significant differences between accident- 
prone and non-accident-prone employees using 
a multiple-choice Rorschach, the Rosenzweig 
Picture-Frustration Test, and 71 Bernreuter- 
type items. A military aviation study (O’Gor- 
man & Kunkle, 1947) found some statistically 
significant and suggestive differences on the 
MMPI among three groups of pilots with 
different accident records. Of a no-accident 
group 12% had Pd standard scores of 60 or 
above and 2% Ma standard scores of 75 or 
above. Of a group with accidents due to vio- 
lating rules, 20% had Pd scores of 60 or 
above and none had Ma scores of 75 or above, 
but of a group having accidents due to taxi- 
ing with carelessness, 16% had Ma scores of 
75 or above. 

The results that are available from studies 


1 This research was done with the cooperation of 
the Minnesota State Highway Department. The opin- 
ions and assertions made, however, are those of the 
writers and should not be construed as official or rep- 
resenting the views of the Highway Department. 


DRIVER BEHAVIOR AND SCORES ON THE MMPI* 
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of relationships between personality and acci- 
dents are far from conclusive but are intrigu- 
ing. 

The purpose of this study was to determine, 
after a period of four to six years, if there is 
a relationship between the accident and vio- 
lation records of drivers and their scores on 
the Minnesota Multiphasic Personality In- 
ventory. 


METHOD 


The subjects of this study consisted of 993 resi- 
dents of Minnesota who were registered as freshmen 
in the General College of the University of Minne- 
sota in 1951, 1952, or 1953. This college offers a two- 
year program in general education, and the students 
in the college tend to be comparable in terms of 
ability and high school performance to the total 
group of high school graduates in the state. Most 
General College students come from the Minneapolis- 
St. Paul area, and most fall within the age range of 
18 through 22. The mean age of the group was about 
19 at the time the MMPI was administered. 

During the three years, approximately 2,000 stu- 
dents were registered as freshmen in this college. 
Complete information regarding both personality in- 
ventory scores and driving records was available for 
slightly over 1,000, and as relatively few persons for 
whom such information was available were women, 
the study was restricted to men. The difficulty of ob- 
taining driving records for women was due to the 
change of women’s names resulting from marriage. 

When a student enters the General College, he is 
expected to take a group of tests, including the 
MMPI. The proportion of entering students who 
actually take this test varies from year to year from 
70 to 90%. The names of entering students were ob- 
tained, the students identified for whom test scores 
were available, and then these names were checked 
against the records of the Minnesota State Highway 
Department. From the driver’s license application, 
form in this department, the age, address, and date 
of issuing the driver’s license were obtained. From 
the driver’s record packet in this office was obtained 
information regarding violations, including date of 
violation, type of violation, place of violation, and 
department action taken against the driver, along 
with information about accidents which included 
date of accident, number of accidents, and the acci- 
dent records file number. 


5 
: 
at 
at 
a 
4 
18 
| 
wa 


Driver Behavior and the MMPI 19 


After the frequency distributions of the number of 
accidents and number of violations were obtained, 
three groups of 100 individuals each were selected 
from the 993 males for more intensive study. The 
first group consisted of the 100 persons with the 
greatest number of violations and accidents. In this 
group were persons who had five or more violations 
and/or three or more accidents. The low group con- 
sisted of 100 persons selected from the total group 
having no violations and no accidents on their rec- 
ords. A middle group of 100 persons was selected 
who fell at about the mean of each distribution, and 
these were people who had one and two violations 
combined with one accident. Frequency distributions 
were then tabulated, and the mean raw scores on the 
MMPI were compared for these three groups. The 
proportion of persons in the low group and the high 
group answering each question on the Pd (Psycho- 
pathic Deviate) scale and the Ma (Hypomanic) scale 
was determined, and the groups were compared in 
order to select items which differentiated between 
the groups. 

A questionnaire was prepared and sent to each 
person in each of these three groups. Included on 
this questionnaire were questions asking for the num- 
ber of miles driven per year, the number of years 
the person had been driving, the number of acci- 
dents the person had had, and the number of traffic 
tickets other than for parking. Two follow-ups to 
the original questionnaire resulted in a return of ap- 
proximately 80%. 


TABLE 1 


FREQUENCY DISTRIBUTIONS OF NUMBER OF VIOLATIONS 
AND NUMBER OF ACCIDENTS 


Violations 


Accidents 


Number 
of Acci- 


dents 


Number 
of Viola- 
tions 


Number 
of Men 


Number 
of Men 


321 
229 
155 
107 

72 


Total 


Total 993 


Note.—Mean violations = 1.88. Mean accidents = 1.09, 


TABLE 2 


MEAN Raw Score ror Eacu ScaLe or tHE MMPI 
FoR Hicu, Meprium, aNp Low Groups 


Mean 


Low Middle High 
Group Group Group 


1.93 42 
3. 3.41 3.27 
4.91 5.81 

14.82 14.75 

13.08 12.56 

18.54 17.71 

19.73 19.24 

23.29 23.99 

22.58 21:70 

8.98 9.11 

27.28 

27.99 

21.44 

22.81 


RESULTS 


Table 1 presents the number of violations 


and the number of accidents for the 993 


drivers as shown in the Highway Department 
records. Approximately one-third of the driv- 
ers had no violations, more than 10% had 
five or more violations. More than one-half of 
the drivers had at least one accident, and 
well over 10% had at least three accidents. 
The mean number of violations was 1.9, and 
the mean number of accidents was 1.1. 

It must be noted here that many of these 
drivers had been driving for at least two or 
three years before they took the MMPI, and 
the number of violations and accidents in- 
cludes all of those reported in Minnesota, some 
before and some after the time of testing. 

Table 2 shows the mean scores on the 
scales for the three separate groups. In gen- 
eral, the means are not significantly different 
with the exception of the Pd and Ma scales, 
where the poor drivers tend to obtain ele- 
vated scores. 

The correlations between the number of 
accidents and the number of violations and 
scores on the MMPI are shown in Table 3. 
With a group of 993 persons, a correlation of 
.08 is significant at the .01 level. Here it is 
seen that the significant correlations with Pd 
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TABLE 3 


CORRELATION COEFFICIENTS FOR NUMBER OF VIOLA- 
TIONS AND NUMBER OF ACCIDENTS FOR 
Eacu SCALE or THE MMPI 


Correlation Correlation 


Coefficient Coefficient 
of Scale and of Scale and 
Scale Violations Accidents 

—.051 .007 
F O88 .079 
K .000 — .038 
Hs — .005 .000 
D —.029 — .007 
Ily .038 002 
Pd 101 083 
Mf — .040 — .032 
Pa .040 .046 
Pi — .004 — .027 
Sc .000 
Ma 075 097 
Si — .078 — .006 


and Ma scales are so small that they have 
little practical value. 
Examination of the profiles suggests that 


the obtained differences in the group analyses. 


are small because the group of poor drivers 
contains a number «i different kinds of profile 
groups or personality types. One might specu- 
late, for instance, that one kind of poor driver, 
perhaps characterized by a high Pd score, is 
the driver who doesn’t conform to much of 
anything, who doesn’t have respect for the 
rights of other individuals or groups, who is 
quite completely self-centered. Another kind 
of driver well might be one who is extremely 
hostile toward other persons, perhaps as shown 
by a relatively high Pa score. His driving be- 
havior may be motivated by a desire to get 
even with the next driver or to show the next 
fellow. A third type of driver might be one 
with an elevated Ma score, the driver who is 
in a great hurry and whose main interest is 
in reaching his destination as soon as possible, 
or sooner. When scores for these different 
types of drivers are combined, the elevated 
scores of one group tend to be compensated 
for in group analysis by the depressed scores 
of other groups, and in general the type of 
group analysis done here reveals little. 

The self-reports of the number of years of 
driving and the number of miles driven per 


year were compared for the three groups. A 
chi-square test showed that the poor driv- 
ing group reported they had been driving a 
slightly significant greater number of years 
than the good driver group, but the differ- 
ence in terms of number of miles per year 
was not significant. The number of accidents 
and violations in the self-reports were com- 
pared to those in the Highway Department 
records, and a strong and consistent tendency 
was found for drivers to report fewer acci- 
dents and violations. The mean number of 
accidents recorded for the poor drivers was 
3.49, the mean number they actually reported 
themselves was 2.77, an understatement of 
21%. The mean number of violations re- 
corded was 5.88, and the mean of the per- 
sonal reports was 3.98. an understatement of 
33%. 


CONCLUSION AND DISCUSSION 


The results indicate that there is a small 
but observable relationship between scores on 
the MMPI and the number of accidents and 
violations of drivers. The observed differences 
suggest that a list of test items might pro- 
vide some assistance in identifying drivers 
prone to have accidents because of their own 
personality characteristics. 

If persons can be identified who have dif- 
ferent types of personalities related to acci- 
dents and violations, perhaps the actual oc- 
currence of the accident or violation can be 
reduced simply by calling to the attention of 
the individual the fact that he has this pat- 
tern of characteristics. Perhaps more intensive 
instruction would be necessary and it may be 
that certain persons can be identified, in 
driver training courses or through the means 
of driver’s license examinations, who require 
more extensive counseling or perhaps even 
psychotherapy in order that they understand 
their own motivations sufficiently to control 
them when they are at the wheel. 

Another possibility is suggested here by the 
difference in the self-reports and the actual 
recorded number of accidents and violations. 
Drivers who tend to under-report their viola- 
tions may not be lying, they actually may be 
reporting themselves as they perceive them- 
selves. Most persons perceive themselves as 
good, competent drivers, particularly young 
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men in this age range. Driving well is one of 
the things almost all young men feel them- 
selves capable of doing. Any experience that 
tends to threaten this self-perception may 
arouse many defenses, and the most obvious 
kind of evidence that a person may not be as 
good a driver as he believes, that is, actual 
arrest or traffic tickets, may be quickly shut 
out from awareness or repressed so that the 
person can maintain the favorable perception 
he has of his own driving skill. The first prob- 
lem with persons who have poor driving rec- 
ords may be to get them to realize they are 
poor drivers. 

Adler (1941) proposed on the basis of a 
clinical study of 130 industrial accident cases 
five personality types prone to have repeated 
accidents. These were: 


Persons with revengeful attitudes 

Persons considering themselves as unlucky 
Persons longing to be pampered 

Persons who are overly ambitious 

Persons who are overly fearful 


The concept of a personality type which is 
accident prone is difficult to defend, bui the 
concept of habitual patterns of attitudes and 
behaviors which contribute differentially to 
the occurrence of accidents finds some sup- 
port in this study. 

An attitude of unconcern about others, with 
a failure to benefit from experience, accom- 
panied by a disdain for rules and regulations, 
the Pd syndromes on the MMPI, well might 
be related to accidents. 

Another set of attitudes more directly in- 
volves feelings of hostility. Persons who ha- 
bitually respond to others with suspicion, 
who view every social situation as competi- 
tive, who strive*for dominance in every in- 
stance, may as a result of these attitudes 
have accidents. This is the Pa syndrome. 


Persons with high Ma scores may value 
effort and movement for their own sake, may 
pay little attention to social consequences of 
their acts, and may have accidents. 

Not all persons with these attitudes will 
have accidents, for the results of some of these 
attitudes can be suppressed by other atti- 
tudes. The Pa syndrome may be suppressed 
by a tendency to withdraw from others, or 
by a fear of injury. None of these speculated 
relations are simple; rather, they are only 
suggestive. 

SUMMARY 


A group of 993 men was administered the 
MMPI when they were freshmen in college. 
From four to six years after this, their driv- 
ing records were investigated, and a very 
small but statistically significant relationship 
was observed between scores on the Pd and 
Ma scales of the MMPI and the number of 
accidents and the number of violations of 
these individuals. The Small size of these re- 
lationships mizht be due to the existence of 
more than one kind of personality pattern re- 
lated to poor driving. Knowledge of the kind 
of personality organization and motivation of 
a driver may be useful for purposes of both 
licensing and training drivers. 
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This investigation was designed to study 
the relation between company policies and 
supervisors’ attitudes toward supervision in 
two companies, one of which was character- 
ized as having democratic management, the 
other, an authoritarian management. 


RESEAkCH DESIGN 


Selection of the two companies. Two industrial 
organizations were selected for detailed investiga- 
tion because from a preliminary study of anecdotal 
data, firsthand observations, and.a tentative analy- 
sis of personnel policies, it appeared that these two 
companies differed substantially in their approach to 
management. Both companies are leaders in their 
respective industries and maintain their executive 
and administrative offices in a large metropolitan 
city along the eastern seaboard. The democratic 
company is a medium sized organization engaged in 
the metal fabricating industry and operates a number 
of manufacturing plants in various parts of the 
country. Its executive office, comprising approxi- 
mately 70 employees, is primarily concerned with 
administrative and managerial functions. The au- 
thoritarian company is one of the nation’s largest 
operating and holding companies with varied and 
diversified interests throughout the United States as 
well as in several foreign countries. Approximately 


1 This paper is based upon a portion of a Ph.D. 
dissertation submitted to Teachers College, Columbia 
University. The author wishes to acknowledge the 
guidance of his advisers, Albert S. Thompson, Rob- 
ert L. Thorndike, and Kenneth F. Herrold. 


Factor Democratic Concept 


Philosophy 


Management 


(2) 


Supervision 
through his subordinates. 
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(1) An industrial organization must show a 
profitable operation, but it also has a broad 
of obligation to its employees and to the public. 


A supervisor should give leadership and 
Concept guidance to his employees and build an effi- 

of cient and satisfied cooperative work team. 
This is accomplished by working with and 


2,006 persons are employed in the company’s cor- 
porate office and are engaged in carrying out duties 
similar to those performed in the democratic com- 
pany. 

Personnel policies in the two companies. In each 
of the two companies, personnel policies currently 
in practice were subjected to a detailed content 
analysis. This was done by studying written state- 
ments, administrative decisions, manuals, annual re- 
ports, and letters to employees and stockholders as 
well as stated goals and objectives. In addition to 
investigating personnel policies appearing in com- 
pany literature, management officials in both organi- 
zations were interviewed using an unstructured in- 
terview. All data gathered in this fashion were clas- 
sified into numerous categories. This was then fol- 
lowed by an investigation of what appeared to be 
significant differences as well as similarities between 
the two companies. Although there were fundamen- 
tal differences in personnel policies between the two 
companies, there were also numerous basic similari- 
ties. Of primary significance was the finding that 
in both companies consideration for the employee 
was accorded major importance. In both the demo- 
cratic and the authoritarian companies management 
officials were most emphatic in stressing the impor- 
tance of treating their employees with maximum 
consideration. In addition, both organizations pro- 
vided favorable working conditions, attractive sala- 
ries and advantageous benefits for their employees. 

Differences between the two companies. Besides 
similarities found in both companies, the content 
analyses of policies of the two organizations dis- 
closed substantial differences which are grouped un- 
der six factors: 


The function of a commercial enterprise is 
basically to show a profitable operation. All 
other considerations are secondary. 


A supervisor should develop his employees 
to the extent that they understand exactly 
what is expected of them, so that they will 
be able to perform their work efficiently. 
Emphasis is placed on the supervisor’s own 
personal qualifications as evidence of leader- 
ship ability. 
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Differences between the Two Companies—Continued 


(3) 


Joint 
Participation 


Factor 


Democratic Concept 


Authoritarian Concept 


Employees participate with management 
in defining the goals and policies of the com- 
pany as well as take part in the planning of 


All policy formulation is decided upon by 
management without employee participation. 
Work assignments are planned by depart- 


the work of their own departments. 


(4) Employees are entitled to a great deal of 
information abéut the company and its ac- 
In order to foster the spirit of co- 
operation, the better informed the employees 
are, the better they can contribute to the 


Releasing 
of tivities. 
Information 


progress of the company. 


(5) The employee’s individuality is recognized 


Recognition 
of the 
Individuality 
of the 


Employee 


and respected. 


applied in dealings with employees. 


(6) A general flexibility is prevalent. 


Rigidity- 
Flexibility 
circumstances into account. 


Persons are not alike in all 
ways and the law of individual differences is 


ployee rules, regulations, and policies are 
more flexibly interpreted, taking individual 


ment heads and then distributed to each 
employee. 


Information should be restricted to a select 
group of management officials. Employees 
need to know only enough information asis 
required for the successful performance of 
their duties. 


Persons are regarded as essentially alike 
and consequently must be treated in a uni- 
form manner according to established rules 
and regulations determined by management. 


Em- A general lack of flexibility is prevalent. 
Employee rules, regulations, and policies are 
rigidly stated, and deviations occur rarely 


and only under very unusual circumstances. 


Raters’ evaluation of personnel policies in the two 
organizations. Because of the unavailability of an 
established scale of democratic-authoritarian man- 
agement, it was necessary for the investigator to 
construct has own scale based on a content: analy- 
sis of personnel policies in each of the two com- 
panies. This introduced a regrettable, but unavoid- 
able, element of subjectivity. To some extent, how- 
ever, it was felt that this subjectivity could be sig- 
nificantly reduced if the data obtained from the 
content analyses of the two companies’ policies could 
be evaluated by a group of impartial raters. Ac- 
cordingly, the data that appeared to differentiate the 
two companies were summarized into 60 matching 
statements, half describing the democratic company 
and the other half referring to the authoritarian com- 
pany. The statements were then categorized into 
the six factors previously defined and presented in 
an analytical and nonidentifying manner to five 
raters, each of whom was actively engaged in the 
field of personnel psychology. Without knowing to 
which company any given item referred, the raters 
were asked to evaluate the data presented using a 
7-point rating scale along a democratic-authoritarian 
continuum. Average ratings for the 60 statements 
were then computed and tested for significance. The 
difference in mean scores between the two com- 
panies was found to be significant far beyond the 
01 level. Apparently the raters agreed with the in- 
vestigator that, based on the statements presented to 
them as a result of the content analyses, the two 
companies did differ. 


Measuring instruments. The two instruments used 
in this investigation were the Supervisory Prac- 
tices Inventory, Form D (Nelson, 1955) and the 
Leadership Opinion Questionnaire (Fleishman, 1957). 
The first instrument is an untimed, 50-item, multiple- 
choice inventory of internalized attitudes toward su- 
pervision (Nelson, 1949). This inventory yields a 
“human relations index” which appears to be some- 
what comparable to the Consideration Scale of the 
Leadership Opinion Questionnaire, discussed below. 
This dimension is referred to in this study as “con- 
sideration.” 

The Leadership Opinion Questionnaire is an un- 
timed, 40-item, multiple-choice questionnaire de- 
scribing how a person thinks he should operate in 
a leadership situation. The questionnaire has been 
factor-analyzed to yield two independent leadership 
dimensions: “Consideration” and “Initiating Struc- 
ture.” These dimensions have been described as 
follows: 


Consideration—Items with high positive load- 
ings on this factor were associated with behavior 
indicative of friendship, mutual trust, respect, and 
a certain warmth between the leader and his 
group. High negative loadings appeared on items 
which suggest that the leader. is arbitrary and im- 
personal in his relations with group members. 

Initiating Structure—Items with high positive 
loadings on this factor imply that the leader or- 
ganizes and defines the relationship between him- 
self and the members of his group. He tends to 
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define the role which he expects each member to 
assume, and endeavors to establish « well-defined 
patterns of organization, channels of communica- 
tion, and ways of getting the job done (Fleishman, 
Harris, & Burtt, 1955). 


In addition to these two instruments, each S com- 
pleted an autobiographical questionnaire which so- 
licited such data as age, education, years of employ- 
ment with the company, years of supervisory ex- 
perience with other companies and the number of 
employees supervised. 

Subjects. A total of 86 Ss participated in this in- 
vestigation, 28 being from the democratic company 
and 58 from the authoritarian organization. All of 
Ss were office supervisors of clerical and administra- 
tive personnel and were responsible primarily for 
women clerical employees. Personal characteristics 
of Ss were analyzed, and the only significant dif- 
ference found between supervisors in the two com- 
panies was that those from the authoritarian com- 
pany were responsible for a much larger group of 
employees than were supervisors from the democratic 
company.* 

RESULTS 


Company policies and supervisors’ atti- 
tudes on consideration toward employees. 
As Table 1 indicates, on neither of the two 
tests measuring a supervisor’s attitude on 
showing consideration toward his employees 
was there a significant difference between 
the democratic and authoritarian companies. 
Even though the two organizations showed 
significant differences along the democratic- 
authoritarian management continuum, their 

2Further analysis indicated that in neither com- 
pany was there a significant relation between the 
number of employees supervised and supervisors’ 
attitudes toward supervision. Differences in the two 


companies could apparently not be explained as a 
function of the size of group supervised. 


Democratic Company 


Instrument Mean SD Range 


Leadership 
Opinion 
Questionnaire 


Consideration 
Scale 


Supervisory 


Practices 29.4 4.2 23-36 
Inventory 


Erwin S. Stanton 


TABLE 1 


Supervisors’ ATTITUDES ON CONSIDERATION TOWARD EMPLOYEE 


mean scores on the Consideration Scale of 
the Leadership Opinion Questionnaire and on 
the human relations index of the Supervisory 
Practices Inventory did not differ signifi- 
cantly. There also appeared to be no sig- 
nificant difference between the standard de- 
viations of the two organizations. The data 
suggest that the attitude of supervisors in 
companies characterized by authoritarian 
policies need not necessarily be less consid- 
erate of the feelings and desires of their em- 
ployees than the attitude of supervisors in 
democratic companies. 

It was further interesting to note that in 
the authoritarian company a positive relation 
was found between length of service and su- 
pervisors’ attitudes concerning showing con- 
sideration toward their employees. This was 
indicated by correlations of .41 (significant 
at the .01 level) and .26 (significant at the 
.05 level) on the Consideration Scale of the 
Leadership Opinion Questionnaire and the 
Supervisory Practices Inventory, respectively. 
One possible explanation for this finding is 
that supervisors in the authoritarian company 
adopt with the passing of time a more be- 
nevolent and paternal attitude toward their 
subordinates. On the other hand, it is also 
possible that differences in selection stand- 
ards may account for these results. Perhaps 


3 An analysis of mean scores of supervisors in both 
companies on the Consideration Scale of the Leader- 
ship Opinion Questionnaire indicated that the scores 
were not significantly different from normative data 
reported by Fleishman, 1957. No normative data is 
as yet available for Form D of the Supervisory Prac- 
tices Inventory. 


Authoritarian Company 


Critical 
Mean SD Range Ratio 
52.6 8.1 36-70 1.13 
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Supervisors’ Attitudes Toward Supervision 


TABLE 2 


Supervisors’ ATTITUDES ON IMPOSING OPERATING STRUCTURE 


Democratic Company 


Mean 


SD 


Instrument 


Leadership 
Opinion 
Questionnaire 


Initiating 
Structure 
Scale 


* Significant at the .01 level. 


selection standards used in employing super- 
visors many years ago were different from 
those more recently in practice. It is fur- 
ther possible that still other factors yet to be 
investigated may have been operating and 
that these could explain the relation between 
longevity and supervisors’ attitudes on show- 
ing consideration toward employees. In the 
democratic company no significant correla- 
tion was found in this particular area. It 
would appear that longitudinal studies need 
to be carried out before definitive conclusions 
can be drawn. 

Company policies and supervisors’ attitudes 
on imposing operating structure. Table 2 
illustrates that there was a significant differ- 
ence between supervisors in the two com- 
panies on the Initiating Structure Scale of 
the Leadership Opinion Questionnaire. Ap- 
parently, supervisors in the authoritarian 
company felt that they themselves should 
clearly define and facilitate group interaction 
toward goal fulfillment and provide for com- 
paratively little employee participation in de- 
cision making. These supervisors believed 
that it was primarily their responsibility to 
initiate leadership functions. Supervisors in 
the democratic company, on the other hand, 
believed in imposing far less operating struc- 
ture in their dealings with employees and 
favored allowing them greater participation. 
There was no difference between the stand- 
ard deviations on this scale for the two com- 
panies investigated.* 


4A comparison of mean scores obtained by super- 
visors in the two companies on the Initiating Struc- 
ture Scale of the Leadership Opinion Questionnaire 


Range 


Authoritarian Company 


SD 


DIscuSSION AND CONCLUSIONS 


Research investigators differ greatly in 
their conclusions as to the type of manage- 
ment that should prevail in an industrial 
organization. Certain investigators have for 
some time reported on the advantages of the 
democratic approach to leadership (Bavelas 
& Lewin, 1942; Bradford & Lippit, 1945; 
Gordon, 1955; Viteles, 1953). On the other 
hand, writers critical of democratic manage- 
ment have recently expressed a preference 
for a return to a more authoritarian type of 
leadership (McMurry, 1958; Tannenbaum 
& Schmidt, 1957). According to supporters 
of democratic management, where employees 
are permitted to participate in decision mak- 
ing, morale and productivity correspondingly 
increase and many of the psychological needs 
of the individual tend to be satisfied. Critics 
of democratic management, however, point 
out that the majority of employees are neither 
able nor interested in engaging in participa- 
tive management and suggest that democratic 
leadership may even impede industrial effec- 
tiveness. These two opposing points of view 
have presented somewhat of a dilemma not 
only to research investigators, but also to 
management officials who pose the question: 
“Under what type of leadership—democratic, 
authoritarian or perhaps a compromise be- 


with normative data published would suggest that 
both organizations represented opposite extremes on 
this leadership dimension. The mean score on this 
scale for the democratic organization was signifi- 
cantly lower and the mean score for the authori- 
tarian company was significantly higher than norma- 
tive mean scores reported by Fleishman, 1957. 
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tween the two—can the organization operate 
most effectively and meet its objectives?” 

Implications of this investigation. This in- 
vestigation was concerned with studying the 
relation between policies of a democratic and 
an authoritarian company and _ supervisors’ 
attitudes toward supervision. It was found 
that in beth companies there was no signifi- 
cant difference in supervisors’ attitudes as to 
the extent of consideration that should be 
shown for the feelings and desires of em- 
ployees. There was, however, a very signifi- 
cant difference in the attitude of supervisors 
as to the type of operating structure that they 
felt should be established in their dealings 
with employees. Supervisors in the demo- 
cratic company believed that they should 
offer less operating structure and favored per- 
mitting employees a greater amount of con- 
trol over their jobs. In the authoritarian 
company, on the other hand, supervisors be- 
lieved in establishing a high degree of operat- 
ing structure in their dealings with employees 
and allowing them comparatively little con- 
trol over their jobs (Stanton, 1959). 

The data supplied by this investigation 
suggest that a company can offer an authori- 
tarian approach to leadership and still show 
consideration for the individual employee. 
Unlike the negative implication suggested by 
the treatment of the individual under a to- 
talitarian national state, this study failed to 
support the assumption on the part of some 
persons that an authoritarian approach in an 
industrial situation must, of necessity, be ac- 
companied by a lack of consideration for the 
fundamental rights of the individual em- 
ployee. Should further studies confirm these 
findings, it is quite possible that many of the 
negative connotations implied by the term 
“authoritarian” are unjustified when applied 
to industry. The results of this investigation 
would suggest that an organization may select 
a democratic, authoritarian, or a compromise 
approach to management and still be consid- 
erate of the feelings and desires of its em- 
ployees. 

The type of approach that a company’s 
management may find most effective for its 
particular organization appears to be closely 
related to situational factors. It may very 
well be that, under certain circumstances, a 


Stanton 


democratic approach to management may be 
more effective, while in another situation, an 
authoritarian approach is preferable. Quite 
possibly, in a military situation, in a crisis, 
or when an emergency arises, firm, authori- 
tarian, and directive leadership may not only 
be the most effective type of leadership, but 
also the style that is most welcome to all con- 
cerned. Another situational factor that must 
be considered in selecting a management ap- 
proach is the particular leadership style pre- 
ferred by the employees and the pattern to 
which they have become accustomed. It is 
suggested that a more democratic approach 
can be instituted only where employees are 
interested in and ready to assume responsi- 
bility for participation, identify with the goals 
and objectives of the organization, and have 
sufficient experience and knowledge to deal 
with the problems at hand. Where these 
qualifications are lacking or appear in limited 
degree, a more authoritarian or compromise 
approach may very well be most effective. 
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Recently, naval aviation technical training 
has shifted its focus from a general training 
program to a training program which intro- 
duces trainees directly into specialized in- 
struction. It is the goal of this specialized 
training to produce men who are immedi- 
ately useful upon graduation in a short “pipe- 
line” time. 

The newer program may be contrasted with 
the technical training previously given. This 
previous training was broader in nature, em- 
phasized more deeply the theoretical aspects 
of the technical skills involved in maintenance, 
and relied to a greater extent on in-service 
training for imparting the specific technical 
skills needed for specific job performance. 
The purpose of the present study was to com- 
pare in a real situation the technical effec- 
tiveness of technicians given specific training 
(transfer through identical elements) with the 
efficiency of trainees given a more general 
background knowledge (transfer through gen- 
eralization). 

Graduates of each training program for 
three naval ratings were studied: (a) jet avia- 
tion machinist’s mate, (4) air controlman, and 
(c) parachute rigger. Success as a parachute 
rigger or a machinist’s mate depends upon 
mechanical ability and perceptual motor skill, 
while success as an air controlman depends 
to a considerable degree on verbal behavior 
and abstract reasoning. 


METHOps, SUBJECTS, AND PROCEDURE 


For each rating a complete library or listing of the 
tasks that the technician could be called upon to 
perform in the fleet was developed and cast in tech- 
nical behavior check list (TBCL) form. Each check 
list contained three parts. Since it was felt that one 

1 This study was performed under Contract Nonr- 
2279(00) between Applied Psychological Services and 
the Office of Naval Research. We are indebted to D. 
Smith, John Nagay, and G. D. Mayo for their as- 
sistance throughout all phases of this work. 


of the most valid indicators of acceptable perform- 
ance is the willingness of a man’s supervisor to as- 
sign him without direct, technical supervision to vari- 
ous technical tasks, Part I determined the amount of 
time spent by the ratee on each of the tasks within 
the rating. In Part II, the amount of supervision that 
the striker required on each of the tasks he per- 
formed was requested. Part III acquired an estima- 
tion of the criticality, in terms of squadron mission, 
of each of the tasks listed. In order to study the de- 
velopmental aspects of the ratees, separate evalua- 
tions were obtained in Parts I and II for each of 
two time periods: the first three months the man 
being rated was in the fleet (T-1) and the fourth 
to ninth months he was in the fleet (T-2). 

The Ss were graduates of the two naval aviation 
training programs, the previous general “A” school 
program and the more recently established and spe- 
cialized program. For the air controlmen 39 gradu- 
ates of the more general training program (hence- 
forth referred to as ACGs) and 42 graduates of the 
specialized program (ACTs) were studied on their 
fleet jobs. 

For the parachute riggers, 10 graduates of the 
previous program (PRGs) and 23 graduates of the 
specialized training program (PRSs) were included. 
In the aviation motor machinist’s mate rating, three 
groups were involved: graduates of the previous gen- 
eral training program (ADGs; N= 21); graduates 
who had received an intermediate type of training 
which involved specialized training but which did 
not include training on the specific equipment used 
in the fleet (ADJIs; N = 60); and graduates of the 
specialized program who had received specialized 
training and practice on the specific equipment found 
in the fleet (ADJSs; N = 36). The subjects were dis- 
tributed over six naval air stations and 32 squadrons. 


DERIVATION OF CRITERION INSTRUMENT 


The mean time in hours spent during T-1 
and T-2 by each group in each rating on each 
of the listed tasks was computed. For each 
task listed in Part II of the TBCLs, a score 
ranging from one to five was assigned: a score 
of five indicated proficient task performance 
after an initial checkout; a score of one was 
assigned for tasks on which the striker had 
received six or more checkouts but on which 
he was still unable to perform without di- 
rect supervision. Scores of four, three, or two 
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were respectively assigned as follows: profi- 
cient after 3 to 5 checkouts, proficient after 
6 or more checkouts, 1 to 5 checkouts but not 
proficient. The mean and variance of the Part 
II scores for each task on each of the three 
separate TBCLs were computed by time pe- 
riod. Additionally, overall means and overall 
variances were computed for each task in 
each of the three separate TBCLs. Last, the 
criticality of each task was derived from the 
responses in Part III of the TBCLs. 

Three item plots were then made, one for 
each of the ratings under consideration. The 
average variance of each task as computed 
from the TBCL, Part II, data was plotted as 
the ordinate value, while the abscissa point 
was determined by multiplying the mean time 
spent on the task by the criticality of the 
task as computed from the TBCL, Part III, 
data. In one sense and in measurement termi- 
nology, the ordinate represented the discrimi- 
nating power of the items (tasks), while the 
abscissa value represented difficulty level. In- 
spection suggested fairly clear cutting points 
on each axis for eliminating items. Essentially, 
in developing the final criterion TBCLs, those 
tasks which were considered “unimportant” 
and which are relatively useless in a measure- 
ment instrument because of lack of discrimi- 
nation (variability) between individuals were 
eliminated. 

Thus, three separate final criterion TBCLs 
were developed, one for each of the ratings 
considered. All data henceforth reported are 
based on these final criterion TBCLs. The 
final criterion TBCL for motor machinists’ 
mates contained 38 tasks; the final criterion 
TBCLs for parachute riggers and air control- 
men contained 43 and 34 tasks respectively. 


RESULTS 


Using the same scoring method discussed 
above, Part II of each TBCL was rescored 
with the total score being equal to the sum 
of the scored tasks divided by the number of 
tasks attempted. The total scores so derived 
were then subjected to a variance’ analysis. 
The results of these analyses are presented in 
Tables 1, 2, and 3. 

These analyses enable a general answer to 
the question of whether the specialized train- 
ing had exerted any general effects on fleet 


TABLE 1 


ANALYSIS OF VARIANCE OF AVIATION MACHINISTS’ 
MATES ScORES 


Sum of Var. 

Source of Variation Squares df Est. F 
Between cells 62,947 5 — - 
Between groups 16,697 2 8,349 1.65 
Between time periods 39,737 i 
Groups X Times 6,513 2 3,256 F<1 
Within cells 905,288 178 5,086 — 
Pooled error (4+ 5) 911,801 180 5,066 
Total 968,234 183 


* Significant at the .01 level of confidence. 


technical efficiency. For the aviation machin- 
ists’ mates and parachute riggers no statisti- 
cally significant between-groups differences 
were evidenced. For the air controlmen, the 
more generally trained group was superior to 
a statistically significant extent. The between- 
time period differences noted were to be ex- 
pected since a trainee would be expected to 
perform at a superior level during his fourth 
to ninth months in the service as compared 
with his first three months. 

The next step was to determine where the 
men in the air controlman’s rating were do- 
ing well and where they needed improvement. 
The response score distributions for the five 
task content clusters included in the final cri- 
terion air controlman’s TBCL were derived. 
These distributions are presented as Table 4. 
The distributions indicated that for each of 
the five content clusters (using equipment, 
using publications, testing equipment, receiv- 
ing and transmitting messages, and controlling 
traffic), the generally trained group had higher 


TABLE 2 
ANALYSIS OF VARIANCE OF AtR CONTROLMENS’ SCORES 


Sum of Var. 

Source of Variation Squares df Est. Ff 
Between cells 87,103 3 - — 
Setween groups 40,009 1 40,009 12.52* 
Between time periods — 45,333 1 45,333 14.19* 
Groups X Times 1,761 1 1761 F<1 
Within cells 448,751 140 = 3,205 
Pooled error (4 + 5) 450,512 141 3,195 
Total 535,854 143 


* Significant at the .01 level of confidence. 
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TABLE 3 
ANALYSIS OF VARIANCE OF PARACHUTE 
RIGGERS’ SCORES 


Sum of 
Squares 


& 


Source of Variation 


Between cells 
Between groups 
Between time periods 
Groups X Times 
Within cells 

Pooled error (4 + 5) 
Total 


58,874 
7,897 
49,385 
1,592 
217,021 
218,613 


19,625 
7,897 
49,385 
1,592 
4,173 
4,125 


* Significant at .01 level of confidence. 


means over both time periods. For the first 
time period both groups were poorest at the 
tasks involved in receiving and transmitting 
messages, while in the second time period the 
generally trained group was, by and large, 
proficient at these tasks, and the specifically 


trained group required additional training. 
Moreover, even in T-2, the specifically trained 
group remained weakest in receiving and 
transmitting messages and in controlling traf- 
fic. These tasks are believed to involve mostly 
nonroutine thinking as opposed to the spe- 
cific information transfer involved in using 
publications, using equipment, and testing 
equipment. 

Similarly, an examination of Table 4 from 
the point of view of within-group strengths 
and weaknesses during T-1 suggests that if 
publication use is ignored, the specifically 
trained group was best at equipment use and 
equipment testing, while the generally trained 
group was best at testing equipment and con- 
trolling traffic. 


DIscuUSSION 


The usefulness of the present findings is de- 
pendent to a large extent on the criterion 


TABLE 4 
RESPONSE ScorE DiIsTRIBUTION FOR TOWER TASKS OF AIR CONTROLMEN 


T-1 


Response % 


Tasks 1 
Using equipment 
AC (G) 
ACT 
Using publications 
AC (G) 
ACT 
Testing equipment 
AC (G) 
ACT 
Receiving and trans- 
mitting messages 
AC (G) 
ACT 
Controlling traffic 
AC (G) 
ACT 
Totals 
AC (G) 
ACT 


Grand total 


Response % 


1 


ie, 
Var. 
1.91 
11.97* 
5 
5 
* 
= T-2 
oo » 9 22 39 24 67 3.66 1.24 6 8 5 13 68 107 4.30 1.48 a 
ian 29 13 28 25 68 3.40 1.59 15 18 8 9 50 109 3.61 2.48 4 
3 51379 67 469 48 0 0 1 3% 494 08 
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) 14 18 35 33 88 1.04 210 2 10 76 124 4.50 1.07 
» 39 21 «27 94 3.01 1.84 15 19 7 10 49 129 361 2.48 
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used. A training criterion must possess cer- 
tain characteristics if it is to be useful. These 
characteristics include relevance, comprehen- 
siveness, consistency, correctly weighted ele- 
ments, and analyzability. Taking these re- 
quirements in turn, the TBCL seems to have 
met the requisites of relevance and compre- 
hensiveness. All of the included jobs were 
drawn directly from the training course and/ 
or the actual job. Furthermore, any nontech- 
nical work characteristics were purposefully 
omitted from the TBCLs. Consequently, the 
TBCLs were relevant by definition. Since 
space was allowed for addition by the rater 
of tasks not included in the list and since 
very few “write-ins’” were made, it can be 
concluded that the technical jabs were com- 
prehensively covered. Moreover, the fact that 
the TBCLs were designed as training criteria 
rather than as general fleet performance meas- 
urements instruments points to the necessity 
of excluding variance due to performance not 
associated with what was acquired in training. 

The third characteristic, an analyzable cri- 
terion, has been amply demonstrated in the 
present study. Total TBCL scores were com- 
puted; the subsections were analyzed indi- 
vidually and combined; individual clusters of 
tasks were analyzed in terms of mean per- 
formance scores, variance, and response dis- 
tributions. All of these analyses were possible 


in relation to time and subgroup differences. : 
The correct weighting of the elements offi 


the criterion received indirect treatment in 
the present research. The item selection tech- 
nique emphasized task criticality, frequency 
of occurrence, and variance. High variance 
tasks are inherently weighted more than low 
variance tasks. Moreover, the emphasis on 
criticality tended to eliminate routine jobs 
from the list. Thus, for instance, the AD 
TBCL included only 12 tasks involving rou- 
tine line operations in comparison with 26 
tasks involving major and intermediate in- 
spections of jet aircraft and jet power plant 
removal and jet engine build-up. 

In order to estimate the reliability of the 
TBCLs, product moment correlations were 
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computed. In some cases a given striker was 
rated independently by two supervisors. In 
other cases, the same rater was asked to re- 
peat his report. These data were combined in 
the reliability determination. It can be ar- 
gued that the combined estimations are per- 
haps the best indication of total reliability 
since they consider both inter-rater and intra- 
rater reliability. The uncorrected reliability 
for the AD TBCL so determined was .84 for 
T-1 and .90 for T-2; for the AC and PR 
TBCLs the uncorrected reliabilities respec- 
tively by time period were .9I, .89, .76, and 
81. 

In terms of the issue of general versus spe- 
cific training, it seems that the specialized 
training for motor machinists and parachute 
riggers exerted few, if any, debilitating effects 
on the technical proficiency of the men in- 
volved. This contention does not seem to hold 
for the air controlmen. The differential effects 
of specialization may be explainable on the 
basis of the differences in skills involved in 
the various ratings. For the air controlmen, 
most of the tasks are of a conceptual nature, 
while for the parachute riggers and motor ma- 
chinists mechanical manipulation is largely 
involved. Moreover, within the air controlman 
rating those tasks of a conceptual nature 
seemed to be the most difficult for the spe- 
cifically trained air controlmen. 


SUMMARY AND CONCLUSIONS 


Technical Behavior Check Lists were used 
for assessing the fleet effectiveness of persons 
who had received two types of training: (1) 
specialized task oriented training, (2) general 
conceptual training. The effects of rapid, spe- 
cialized, task-oriented training in the context 
with which we were concerned varied with the 
on-the-job performance requisites. For tasks 
involving mechanical manipulation few de- 
bilitating effects were introduced by the spe- 
cialized training; for abstract conceptual tasks 
this contention does not seem to hold. 


(Received March 23, 1959) 
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THE VALIDITY OF BIOGRAPHICAL DATA ITEMS FOR 
FOOD COMPANY SALESMEN' 


THOMAS W. 


HARRELL 


Stanford University 


A consulting company has advocated the 
screening of sales applicants on their back- 
ground prior to the use of psychological tests. 
Based on experience a list of six “knock-out” 
factors has been developed. The consulting 
company has requested clients not to test an 
applicant for a selling job if one or more of 
these items is present. The list: 


1. Instability of Residence 

Regardless of the cause of changes of ad- 
dress (unless the applicant has been in the 
armed services or has been moved around by 
his employer), too frequent change of address 
points to lack of roots. 

2. Failure in Business within Two Years 

If your sales applicant failed in his own 
business, within two years prior to the time 
he applies for a sales job with you, he consti- 
tutes a risk. 

3. Divorce or Separation within Two Years 

A man who is undergoing active separation 
or divorce, or who was separated or divorced, 
even if remarried, within two years prior to 
his application, is an undue risk. 


1 This study was sponsored by the Klein Institute 
for Aptitude Testing, Inc. 


4. Excessive Personal Indebtedness 

Some men will work hard when they have 
obligations to meet.’ However, when a sales 
applicant has excessive personal indebtedness 
(not including mortgage on home) he consti- 
tutes an undue risk. Indebtedness is consid- 
ered excessive when it cannot be met within 
two years from earnings on the new job. 

5. Too High a Previous Standard of Living 

A Food Company made two items of this. 
One was previous income too high, which in 
this study meant a salary of $400 or more per 
month. The second was earnings less than 
living expenses. 

6. Unexplained Gaps in the Employment 
Record 


A study has been made of the validity of 
these background facts within the Food Com- 
pany. The Company has used the “knock- 
out” factors during the past several years to 
exclude employment to sales applicants; al- 
though, due to the tight labor market and to 
the judgment of local Sales Supervisors, many 
men have been employed who have possessed 
one or more of the background facts. The 
Food Company has redefined No. 1, Insta- 


TABLE 1 
DISTRIBUTION OF or “KNocK-ouT”’ FACTOR 


Combination 
Sales 


Knock-out Factors N % 


Retail Sales 


Fired 


Resigned 


Job-hopping 6 
Previous income too high* L 16 
Earnings less than living expense 3 
Business failure within past two years — 
Divorced within past two years 

None 


Number of cases 
“Knock-outs” per person 29 


36 33 


7 
4 


* Previously earned salary of $400 or more set as “‘too high.” 
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TABLE 2 


STANDARD ERROR OF THE DIFFERENCE BETWEEN SUCCESSFUL AND UNSUCCESSFUL SALESMEN 


ON PRESENCE OF A “KNock-OuT”’ FACTOR 


Retail Salesmen 


Successful Resigned Fired 
Level of Level of Level of 
Significance z Significance Significance 
Combination sales (successful) 1.148 3.26 1% 1.84 
Retail sales (successful) 1.88 54 
Resigned retail (unsuccessful) 1.40 — 


*® Method used was standard error of the difference between two uncorrelated proportions. Edwards, A. L. Experimental 


design in psychological research. New York: Rinehart, 1950, Pp. 76-77. 


bility of Residence, as Job-hopping. For the 
purpose of the recent study this means having 
had three or more jobs during the preceding 
five years, or two jobs in the past two years 
and obviously to be applying for another. 
The problem was whether or not there was 
any difference in the background facts be- 
tween a group of successful employees and a 
group of unsuccessful employees. The selec- 
tion of persons for each of these samples was 
initially made for another purpose based on 
whether the man had been tested on a series 
of psychological tests. The successful em- 
ployees were made up of two samples. 1. 
Twenty Retail Salesmen were chosen from 
approximately 200 employed. They were se- 
lected as having been employed for at least 
two years and having a high rating of com- 
petence by immediate Supervisor. The job of 
the Retail Salesmen is to promote the sales of 
canned goods through calls on grocery stores 
and through setting up displays. 2. Thirty- 


TABLE 3 
DISTRIBUTION OF KNock-ouT Factors 


two Combination Salesmen were studied. Com- 
bination Salesmen were promoted from Retail 
Salesmen. Combination Salesmen sell to Job- 
bers as well as calling on grocery stores for 
the same purpose as Retail Salesmen. Unsuc- 
cessful employees from the company’s point 
of view were 27 Retail Salesmen who had been 
fired and 25 who had resigned. The samples 
of Combination Salesmen and of unsuccessful 
Retail Salesmen who were fired or who re- 
signed were not taken from the group of 200 
Retail Salesmen then employed from whom 
the sample of 20 Successful Retail Salesmen 
was drawn. 

The distribution of type of “knock-out” 
factors for each of these four groups is shown 
in Table 1. There were no cases of excessive 
personal indebtedness nor unexplained gaps 
in the employment record and hence No. 4 
and No. 6 are not shown. The highest fre- 
quency, .96 “knock-outs’” per person, was 
present in the salesmen who had resigned. In 


Sales 
Knock-out 
Factors 


Successful Resigned Fired 


Retail Sales 


Three 


Total 


32 
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None 24 75 12: 60 8 32 14 52 

One 8 25 8 40 12 48 10 37 

Two 3 12 3 11 

32 20 25 27 

Arh 


Validity of Biographical Data Items 


TABLE 4 
STANDARD ERROR OF THE DIFFERENCE BETWEEN SUCCESSFUL AND UNSUCCESSFUL SALESMEN 
ON NuMBER OF “KNock-out”’ FACTORS 


Level of 
Significance 


Combination sales (successful) 1.14 
Retail sales (successful) 


Resigned retail (unsuccessful) 


order of frequency the other groups follow as 
Fired Salesmen, .59, Successful Retail Sales- 
men, .40, and Combination Salesmen, .25. 
Job-hopping is the most frequent of the fac- 
tors present. 

The significance of the differences between 
the number of successful and unsuccessful 
salesmen who had at least one “knock-out” 
factor is shown in Table 2. Salesmen who re- 
signed were different from Combination Sales- 
men at the .01 level but were not different 
from Successful Retail Salesmen. Fired Sales- 
men did not differ from Combination Sales- 
men nor Successful Retail Salesmen with re- 
spect to presence of a “knock-out” factor. 

The distribution of number of “knock-out” 
factors is shown in Table 3. No successful 
Retail or Combination Salesman had more 
than one “knock-out” factor while five of the 
Resigned Salesmen and three of the Fired 
Salesmen had two or more. Using the total 
number of “knock-out” factors shown in 
Table 1, the significance of the differences be- 
tween Successful and Unsuccessful Salesmen 
is shown in Table 4. Salesmen who resigned 
had significantly more “knock-out” factors 
than the Combination Salesmen at the .01 
level and than the Retail Salesmen at the .05 
level. Fired Salesmen had significantly more 
“knock-out” factors than Combination Sales- 
men at the .05 level, but did not have signifi- 
cantly more “knock-out” factors than the 
Retail Salesmen. 


Successful 


Retail Salesmen 


Fired 
Level of 
Significance 


Resigned 


Level of 
Significance 


SUMMARY AND CONCLUSIONS 


1. The presence of a “knock-out” factor 
was not significantly different between a sam- 
ple of Successful Retail Salesmen and each of 
two groups of Unsuccessful Retail Salesmen 
who were fired and resigned. 

2. Combination Salesmen who had been 
promoted from Retail Salesmen, were less 
likely to have “knock-out” factors than Re- 
signed Salesmen, as shown by a difference at 
the .01 level. 

3. The number of “knock-out” factors was 
greater among Resigned Salesmen than Suc- 
cessful Retail Salesmen at the .05 level. The 
number of “knock-out” factors was greater 
for Resigned Retail Salesmen than for Com- 
bination Salesmen at the .01 level. The num- 
ber of “knock-out” factors was greater for 
Fired Retail Salesmen than for Combination 
Salesmen at the .05 level. 

4. The most frequent “knock-out” factor 
was job-hopping. 

5. The results confirm the value of some 
use of the “knock-out” factors, although the 
labor market may not permit being as rigor- 
ous as the consultant recommended. There is 
an indication that the presence of two of the 
factors is a better cutoff than the presence of 
only one. The number of cases is too small to 
examine the significance of having two or 
more “knock-out” factors. 


(Received April 15, 1959) 
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This experiment simulated an organization 
in which a team of workers, whose actions 
are structured by management, performs a 
series of repetitive, coordinated, and _ inter- 
locking operations. Under such conditions, is 
the productivity of the team significantly af- 
fected by the amount of time pressure under 
which the team works? If so, does variation 
in time pressure have differential perform- 
ance effects dependent upon the complexity 
of the team’s task? In early research on per- 
formance in industrial settings, emphasis was 
placed upon job specifications, time and mo- 
tion study, and the effects of special working 
conditions. Then studies of behavior in or- 
ganizations shifted to problems of interper- 
sonal relations and group process. But now 
there is evident a revived interest in examin- 
ing the physical characteristics and effects of 
the stimulus situation. 

This “new look,” however, occurs against a 
different conceptual ground. “Working condi- 
tions” are viewed as one set of variables that, 
in interaction with personal, social, and eco- 
nomic antecedents, modify group interaction 
and in turn may affect tangible output on an 
allocated team task. In the present study a 
deliberate effort was made to control the ef- 
fects of differences in individual abilities, 
group structure, and organization in order to 
assess in a precise way the effects of the par- 
ticular situational variables of task complex- 
ity and time pressure. Information about the 


1 This research was done under Contract n6ori-17, 
T.O.111(NR 171-123) with the Office of Naval Re- 
search, but the views expressed are those cf the au- 
thors alone. This paper is a condensation of a more 
detailed technical report (Pepinsky, Pepinsky, & 
Pavlik, 1956), which may be obtained on loan from 
the Gifts and Exchange Department of Ohio State 
University Library. The authors wish to acknowl- 
edge the valuable statistical advice of D. Ransom 
Whitney. 


THE EFFECTS OF TASK COMPLEXITY AND TIME 
PRESSURE UPON TEAM PRODUCTIVITY ' 


PAULINE N. PEPINSKY ann HAROLD B. PEPINSKY 
Ohio State University 
AND WILLIAM B. PAVLIK 


Western Michigan University 


effects of task complexity and time pressure 
also is of interest in relation to diverse theo- 
retical attempts to account for individual be- 
havior associated with strong response compe- 
tition. Recurring empirical evidence indicates 
that decrement in individual performance oc- 
curs under special conditions of increased 
pressure or deprivation, where increases in 
drive level can be inferred as well (Taylor, 
1956). 

Given two team tasks, one (Task S) rela- 
tively simple, and the other (Task C) rela- 
tively complex; and three time conditions, 
designated as low (L), medium (M), or high 
(H) time pressure, the central hypothesis 
was that there would be a significant inter- 
action between the effects upon team produc- 
tivity of Tasks S and C and Time Cond. L, 
M, and H. Specifically, it was predicted that 
comparisons among the means involved in 
this interaction would show: 

(a) Team productivity on Task S to be 
significantly greater under Cond. H than un- 
der Cond. L; 

(6) Team productivity on Task S under 
Cond. M to be intermediate between produc- 
tivity under L and H; 

(c) Team productivity on Task C under 
Cond. M to be significantly greater than pro- 
ductivity under either L or H. 


METHOD 


Subjects. Seventy-two volunteer male Ss were as- 
signed to 24 three-man teams. The Ss were enrolled 
in the university introductory psychology course. 
The only other restrictions in selection were the ex- 
clusion of individuals whose obvious physical handi- 
caps would have prevented their performing an as- 
sembly task, and the requirement that each team be 
comprised of previously unacquainted persons. 

Experimental Tasks. Task complexity was estab- 
lished at two levels. Each task involved the repeated 
assembly and disassembly by a team of as many 
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replicas of a nonrepresentational Tinkertoy model as 
possible during a 20-min. session. The possible effect 
of organizational differences was controlled by the 
stipulation that all teams follow a rehearsed pro- 
cedure, performing interlocked operations in a speci- 
fied order, with the role and function of each S allo- 
cated by E. Between sessions the position of Ss were 
rotated so that each worked with a different type of 
component on each task. 

Task complexity was defined by (a) the number 
of operations required to assemble the model, (5) 
the necessity for simultaneous coordination among 
the team members to perform some operations—in 
contrast to operations performed in sequence by in- 
dividual Ss, (c) the amount of variety in the or- 
dered pattern of assembly operations, and (d) the 
number of spatial dimensions in the completed model. 
The physical arrangements and E’s procedures were 
the same for both the S and C tasks, except for ad- 
justments to correspond to the variation in com- 
plexity. 

Measure of the Dependent Variable. A team’s score 
on each task was an adjusted total of the number of 
operations performed during a work session. The ex- 
act procedure used to determine the scores is speci- 
fied in Pepinsky, Pepinsky, and Pavlik (1956, pp. 
84-85). The numbers of operations credited for dis- 
assembly of both models and for final assembly of 
the complex model were determined by the mean 
proportionate amounts of time spent by all 24 teams 
in these particular phases of the tasks. A ratio based 
on all teams was considered to be a more reliable 
weighting than one computed for each team sepa- 
rately. The only distortion that the weighting sys- 
tem could introduce seemed to be a possible overall 
reduction in the variability of the scores, with the 
concomitant risk of conservative error in the final 
conclusions. 

Procedure. For each of the experimental teams the 
procedure consisted of the same consecutive phases: 
(a) a preliminary session of supervised practice on 
the first task, (b) the first 20-min. experimental ses- 
sion, (c) a 5-min. recess, (d) a session of practice 
on the second task, (e) the second 20-min. experi- 
mental session, (f) administration of a postexperi- 
mental questionnaire, and (g) a brief decompression 
session. All sessions were held at the same time of 
day. 

For each task the team built successively a speci- 
fied number of models, which constituted roughly 
comparable work units (a set of six simple models 
or a single complex model). The team had each unit 
checked by E, and then disassembled these models 
before beginning new ones. To prevent attenuation 
of the effect of time pressure through tension release 
in casual conversation and joking, E stated that team 
members were permitted to discuss the task only. 
During the experiment, E did not interact with Ss, 
except to indicate by a nod or a gesture that com- 
pleted models were correctly assembled or should be 
corrected. 

The three time conditions were manipulated by 
varying the frequency of standard verbal signals 


given over a loudspeaker. Condition L consisted in 
one signal given at the end of 10 min., Cond. M, a 
signal every three min. up to 18 min., after which 
the stop signal followed in two minutes; Cond. H, 
signals delivered at an accelerated pace beginning 
with one every minute and increasing to one every 
15 sec. The signals consisted simply in the time- 
keeper’s saying, “ minutes to go,” or toward 
the end of the session, “ minutes,” or “ 
seconds.” 

Experimental Design. The experiment was con- 
ducted according to a factorial design, counterbal- 
anced to permit separate examination of the effects 
of the time conditions, the tasks, time condition se- 
quences, task sequences, and blocks. In one block of 
12 groups each task, placed once in first and once 
in second order, occurred in combination with each 
time condition, placed in each of the six possible se- 
quences of three conditions, taken two at a time. 
The design included two blocks or one complete rep- 
lication. All of the independent comparisons were al- 
lotted to the effects of these variables and their in- 
teractions. 


RESULTS 


Checks upon the Experimental Conditions. 
The Ss’ responses to five postexperimental 
questionnaire items tell us (a) whether they 
identified Task C as more complex than Task 
S, and (6) whether their discriminations 
among the time conditions were consistent 
with the experimental attempt to vary time 
pressure. All but four Ss identified Task C 
as the more complex. A significant proportion 
(p less than .01) liked working on Task C 
better than Task S and preferred the second 
session to the first, but Ss’ preference for one 
session over the other was not related to the 
time conditions. Regardless of either task or 
time conditions, Ss in general tended to be- 
lieve (p less than .001) that they were more 
productive during the second session. 

A significant majority of Ss (p less than 
.001) did feel under more pressure and were 
more aware of the passage of time during the 
session in which they worked under the higher 
of two time conditions (i.e., either H as op- 
posed to M, or M as opposed to L). There 
were tendencies (pf less than .10 but greater 
than .05) for Ss’ feelings of being under pres- 
sure to occur with relatively greater fre- 
quency during work on Task C and during 
the second work session. There is no evidence, 
however, that task or session alone had any 
relationship to Ss’ reported awareness of the 
passing of time. 
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TABLE 1 


MEAN Propuctivity SCORES ON Two TASKS UNDER 
TuHreE Time Conpitrions 


Time Condition 
— - Grand 


L M Mean 


615.21 
728.17 


590.05 605.68 


705.43 


671.69 


Performance during Separate Work Ses- 
sions. Table 1 presents the sets of mean scores 
most directly relevant to the hypotheses. By 
inspection, the Task C scores are consistently 
larger than the Task S scores, and there seems 
to be a curvilinear variation in productivity 
from Conditions L to M to H. The signifi- 
cance of these apparent trends was tested 
first by means of a complete analysis of vari- 
ance of single session scores (shown in Pepin- 
sky, Pepinsky, & Pavlik, 1956). This basic 
analysis did not support the ceatral hypothe- 
sis. It demonstrated unequivocally that differ- 
ences between tasks and the interaction of 
tasks and task sequence are primarily respon- 
sible for the variability in total team scores 
in separate work sessions: scores are signifi- 
cantly higher on Task C than on Task S, and 
higher in the second session than the first.* 

Performance over Time. As an alternative 
to the analysis of performance in separate 
sessions, difference scores were examined to 
determine whether changes in the produc- 
tivity of the same teams from one session to 


2 The apparent task difference cannot be attributed 
to the extraneous effect of the amount of time con- 
sumed by E in the checking operation. A ¢ test of 
the difference between the two tasks in mean opera- 
tions per second of available work time shows that 
a reliable difference in favor of the complex task re- 
mains (p less than .001). Differences among mean 
operations per second under the three time condi- 
tions are not statistically significant, although the 
mean production rates for both tasks under Condi- 
tion M appear to be slightly higher than under L or 
H. Plotted curves (not shown) of the median times 
spent by the 24 teams on assembly and disassembly 
of successive models show within-session trends con- 
sistent with the experimental predictions. But these 
tendencies occur within too restricted a range to 
have a significant effect upon total scores. 


the next reflected a curvilinear relationship to 
changes in time pressure. In order to obtain 
a basis of reference, the 24 first session scores 
were analyzed separately. This preliminary 
analysis of variance (not shown) shows that 
the main effect of tasks (with Task C pro- 
ducing larger scores) is significant at the .01 
level and substantially accounts for the vari- 
ance in these “first” measurements. The Time 
Conditions do not have a significant effect 
upon the first session scores, nor do they in- 
teract significantly with the Tasks. 

As shown in Table 2, an analysis was made 
of changes observed in the second session, 
when the same teams performed on different 
tasks under different time conditions. The 
measures employed are the 24 algebraic dif- 
ferences obtained by subtracting the Session 1 
scores from the Session 2 scores. The three 
independent variables involved here are de- 
fined as follows: 

(a) Two Changes 

(6) Three Time Condition Pairs—L s H; 
LsM; or MSH, taken without regard to 
order of presentation, so that the Class Ls H 
includes all teams that performed under L in 
one session and H in the other, regardless of 
which came first and which second. 

(c) Two Directions of Change in Time 
Condition—“higher” time condition in the 
second session than in the first, or “lower” 


in Task—S—C, or 


TABLE 2 
ANALYSIS OF VARIANCE OF DIFFERENCES BETWEEN 
PRODUCTIVITY SCORES OF SESSION 1 AND 
SEssION 2 FOR 24 TEAMS 


Mean 
Source of Variation df Square 


243,587.305 
1,112.816 


Change in task 

Time condition pair 

Direction of change in 
time condition 

CTa X DC TiC 


642.631 
2,264.230 

782.842 
9,427,995 


1,286.036 
1,305.608 


Cha XTic Px 
Error 


Total 


* Significant at .01 level. 
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time condition in the second session than in 
the first, ie., increase vs. decrease in time 
pressure. 

All of the subclass means are positive in 
sign (i.e., indicating that Session 2 scores 
show increases over Session 1) except for the 
change in task from C to S. Comparisons be- 
tween tasks consistently show a pronounced 
discrepancy in favor of C. Since Task C scores 
are higher than Task S scores, and Session 2 
scores are higher than those of Session 1, the 
difference between C and §S is increased when 
C occurs in the second session. Consequently, 
Change in Task, treated as a main effect, 
exerts an overwhelming influence in compari- 
son to either of the two Time Condition vari- 
ables, and it is clearly independent of both. 

What emerges in Table 2 as a new finding 
is the significant (p less than .01) first-order 
interaction between Time Condition Pair and 
Direction of Change in Time Condition. (Ap- 
plication of Bartlett’s test indicates that the 


variances of the six subclasses involved in this. 


interaction are homogeneous.) While Time 
Conditions have no observable effect upon the 
first session scores, changes in the second ses- 
sion scores are differentially affected by Di- 
rections of Change in Time Condition occur- 
ring in different Time Condition Pairs. Thus, 
if Condition L is followed by H, a smaller in- 
crease occurs than if L is followed by M. In 
the instance of the L<sH pair, a shift in the 
L to H direction produces a smaller increase 
than a change from H to L. These trends are 
reversed in the case of the Ls _M combina- 
tions, i.e., L to M produces a larger increase 
than M to L. The discrepancies associated 
with direction of change in the MssH pair 
have relatively small values. 

When task (a mean difference of about 160 
points) and practice (a mean difference of 
about 64 points) effects are removed, it be- 
comes apparent that adjusted score differ- 
ences ascribable to changes in time pressure 
alone are not consistently positive in sign. 
Irrespective of task, a gain in productivity 
accompanies an increase in pressure from L 
to M, but a decrement occurs when pressure 
increases from L to H; the latter change in 
performance is larger in magnitude than that 
associated with a reduction in pressure from 
H to L. Although a curvilinear relationship of 
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time pressure to team performance was pre- 
dicted for Task C only, it turns out that over 
an extended period, when changes in pressure 
upon the same teams occur, this effect charac- 
terizes their performance on both tasks. 


DISCUSSION 


The most striking finding is the predomi- 
nant effect upon team productivity of the dif- 
ference between the simple and the complex 
tasks; greater task complexity is consistently 
associated with greater productivity. An in- 
crease in task complexity does have a positive 
motivational effect through maintaining inter- 
est in the performance of repetitive opera- 
tions. This interpretation is supported by the 
statements of Ss independent of their meas- 
ured team performance. 

Although Ss reported that they felt under 
more pressure and were more aware of the 
passage of time when the time signals were 
increased in frequency, their responses to 
more or less frequent signals were a matter 
of indifference in terms of perceived reward 
value. Incentive properties are ascribed to 
time pressure conditions as they are clearly 
made to connote present or potential success 
or failure in a given situation; otherwise, in- 
terposed time signals may be little more than 
a trivial distraction. 

Despite the initial attempt to minimize the 
effects of learning, the second session scores 
still show a practice effect, which is independ- 
ent of the task and task sequence differences. 
Since the effect of practice is virtually the 
same no matter whether Task S or C is pre- 
sented first, the two tasks evidently are not 
widely discrepant in difficulty. Both appear to 
lie within what Day (1956) has called an 
“optimal zone of performance,” although they 
are discriminably different in respect to stimu- 
lus complexity. Failure to corroborate the cen- 
tral hypothesis in respect to single session 
scores may have been a function of compa- 
rability in task difficulty. But an increase in 
this task differential also would have led to 
a differential contamination of performance 
with learning on the more complex task. 

It may be that even if it is highly prestruc- 
tured, the team (as contrasted with the indi- 
vidual) work situation is indigenously so com- 
plicated that over an extended period high 
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time pressure is apt to produce a decline in 
productivity, even on an apparently quite 
simple group task. This inference is com- 
patible with the finding that differences be- 
tween sessions in productivity show a signifi- 
cant and curvilinear relationship to changes 
in time pressure, regardless of changes in task. 
In individual task performance accuracy un- 
der high pressure may vary as a function of 
initial differences in drive level. But a team 
engaged in tightly interlocked operations may 
be no more competent than its “weakest link,” 
ie., the member most predisposed toward loss 
of efficiency as pressure increases beyond a 
certain point. 

The experiment is a beginning attempt to 
deal with a complicated and important prob- 
lem. Present results underscore the importance 
of specifying both the operational and the 
phenomenal stimulus characteristics of tasks 
employed in research on behavior in groups 
and organizations and the likelihood that task 
variables may account for apparently contra- 
dictory results in otherwise similar studies. 
This study also provides tests of the effects 
upon team performance of two situational 
variables that previously have been used in 
conjunction with other procedures (cf. Lan- 
zetta, 1955; Schutz, 1955) without precise 
knowledge of their independent predictive 
value. 

The experimental findings have several prac- 
tical implications for situations whete small, 
trained teams are employed in the perform- 
ance of highly structured, repetitive, assembly 
tasks. It seems probable that such teams will 
be more motivated to increase output, if their 
assigned tasks are sufficiently complex to pro- 
vide some intrinsic interest and variety than 
if they are so simple in terms of the workers’ 
ability as to be highly monotonous and bor- 
ing. Whether the task is very simple or more 
complex, an acceleration in time pressure up 
to a moderate level can produce an accelera- 


tion in output, but an increase from low to 
high pressure is apt to have an adverse effect. 


SUMMARY 


An experiment was designed to test the 
effects of the complexity of repetitive group 
tasks and of varied conditions of time pres- 
sure upon the output of structured work 
teams. Twenty-four three-man teams each 
performed two practiced assembly tasks, one 
relatively simple, and the other relatively 
complex. Time pressure was established at 
three levels by varying the frequency of in- 
terspersed oral signals, which announced the 
time remaining in a given work session. Team 
productivity was defined as an adjusted total 
of the number of operations performed on 
each task. 

The central hypothesis that there would be 
a significant interaction between task com- 
plexity and time pressure is not supported. 
There is, however, positive evidence (a) that 
the teams were more productive when work- 
ing on a task that was sufficiently complex to 
reduce boredom, but (2) that, irrespective of 
task, when changes in time pressure occurred, 
the effect upon changes in team productivity 
was curvilinear. 
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AN INEXPENSIVE METHOD TO DETERMINE THE 
EFFICIENCY OF A TELEVISION PROGRAM’ 
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Constantly rising television costs occasioned 
by expanding program costs and an expand- 
ing list of markets into which programs are 
broadcast make the rapid determination of 
audience information increasingly desirable to 
the television advertiser. The television rat- 
ing services, while providing a considerable 
amount of audience information for the ma- 
jor markets, provide infrequent reports or 
none at all for many minor markets. Informa- 
tion relative to these smaller markets is never- 
theless important to the advertiser who is 
purchasing time on stations in these areas. 
Further, no information is provided for any 
market on a weekly basis. It would appear 
that a need exists for the development of a 
rapid, valid, and inexpensive technique to as- 
sess the commercial effectiveness of a tele- 
vision program in each or all of the markets 
into which it may be broadcast. 


PROBLEM 


A number of quantitative approaches are 
available to evaluate a sponsored television 
program. Certainly a useful criterion is sales 
resulting from a given expenditure of tele- 
vision advertising dollars, but only in un- 
usual circumstances is such information, par- 
ticularly on an individual program or market 
basis, ever available. 

A second criterion used to infer television 
program effectiveness is an estimate of audi- 
ence size. The need for, and usefulness of, 
program audience size measurements led to 
the development of the rating services. Stated 
in terms of ratings, the advertiser learns the 
percentage of households in a given area that 
is exposed to his television program and, pre- 
sumably, to his advertisement. 

A third criterion to determine the success 
of a television program is efficiency, an indi- 

1 This paper represents part of a dissertation sub- 
mitted by the senior author to Syracuse University 


in 1956 in partial fulfillment of the requirements for 
the PhD degree. 


cation of the audience obtained per dollar in- 
vested. On a national basis, it has been regu- 
lar practice for years for the advertiser, or 
his agency, having obtained an estimate of 
the size of the broadcast audience from a rat- 
ing service, to compare the total audience 
with the total cost to determine the unit cost, 
or efficiency. Although one may find excep- 
tions to the practice, efficiency hormally is 
defined as the cost in dollars to reach 1,000 
households ($/M). 

The use of efficiency circumvents problems 
associated with audience size when evaluating 
various ratings for individual markets. One 
market may contain more homes than another. 
Identical ratings in both markets will not in- 
dicate identical audiences. The relationship is 
further complicated by the factor of cost. 
Even if two markets produce audiences of 
equal magnitude, the cost to reach one audi- 
ence may be greater than the cost to reach 
the other. A satisfactory solution to this prob- 
lem is to substitute unit costs, or efficiencies, 
as these are comparable from market to 
market. 

It is recognized that once the audience tunes 
to a program many factors other than cost— 
for example, program image, mood, holding 
power, commercial identification, psychologi- 
cal appeal—become important in achieving 
maximum commercial effectiveness. Such prob- 
lems must be approached and corrected on 
an individual basis and are outside the scope 
of this article. All other things being equal, 
efficiency, as indicated by the dollar cost to 
present the commercial message in each 1,000 
homes, remains as a very useful criterion of 
television advertising success. 

If television program efficiency, as defined, 
is a useful concept, can it be determined 
market-by-market, quickly, and more eco- 
nomically than program ratings are currently 
determined. One can estimate program effi- 
ciency, involving costs and audience size, on 
a market-by-market basis as satisfactorily as 
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one can estimate audience size measurements 
alone. If enough interviewers can be employed 
coincidentally with or shortly after a program 
and the set usage data can be rapidly proc- 
essed, then the problem of speed is solved. 
Economy remains as the major problem, and 
it seems clear that a significant economy will 
be effected only by a decrease in the size of 
the sample. Smaller samples may decrease 
survey costs to the point where more frequent 
measurements in more cities may be possible. 

The purpose of this paper is to describe 
and demonstrate how the technique of se- 
quential analysis, applied to television audi- 
ence size information obtained from a random 
sample, can be utilized to produce useful esti- 
mates of the efficiency of a television program 
quickly and economically. 


METHOD 


To review briefly the theory developed by Wald 
and others (Statistical Research Group, 1945; Wald: 
1945, 1947), any sequential plan consists of examin- 
ing, in sequence, items chosen at random from a lot 
and making, with stated risks of error, one of three 
decisions: (a) that the lot is acceptable, (b) that the 
lot is unacceptable, (c) that the cumulated evidence 
is insufficient for either decision without too great a 
risk of error. If the third decision is made, an addi- 
tional item is inspected and the same three decisions 
are reconsidered on the basis of the additional in- 
formation supplied by the new item. The inspection 
normally continues until the data are sufficient to 
warrant the acceptance or rejection decision. 

In general, the two lot qualities, or two hypothe- 
ses, H: and He, between which one is trying to dis- 
tinguish, are selected so that each of them and their 
difference is practically meaningful. Likewise, the 
risks, a of rejecting Hi when H; is true and § of ac- 
cepting Hi when Hz is true, are chosen with an eye 
to the practical effects of each kind of error. 

The sequential design requires that a person set up 
certain “rules of the game” before he plays the game. 
First, he must decide upon the two hypotheses which 
state the acceptable and unacceptable efficiency limits 
with which he is primarily concerned. For example, 
an advertiser may stipulate that he hopes his pro- 
gram will achieve an efficiency of $10/M, but will 
be displeased if his program achieves an efficiency 
worse than $20/M. That is, an advertiser may very 
much desire to reach homes with his commercial 
message at a cost of one cent per home or less; but 
he will be rather unhappy if his cost per home ex- 
ceeds two cents. He is less concerned with intermedi- 
ate values. Also, the efficiency of major network eve- 
ning television programs approximates $10/M. One 
may have very good reasons for selecting other effi- 
ciencies for H; and He. For example, in a large and 
costly market an advertiser might require an effi- 
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ciency of $7.50/M and be unhappy with an efficiency 
below $15/M; while this same advertiser, in order 
to achieve as near complete saturation as possible, 
might accept $15/M in some smaller markets and be 
unhappy only if his efficiency fell below $30/M. 

In addition to selecting two meaningful efficiencies, 
one must announce two other “rules of the game.” 
First, he must decide upon the value of a, the maxi- 
mum risk or probability of rejecting an efficiency 
denoted by Hi. In the sequential plan, a rejection of 
Hi automatically implies an acceptance of He. Sec- 
ond, he must decide upon the value of 8, the maxi- 
mum risk or probability of accepting H: when H: 
is true. One must assign values on the basis of mean- 
ingful risks to the user. For example, one would nor- 
mally wish that the chances of rejecting a satisfac- 
tory efficiency would be low, say perhaps one in 
twenty. Conversely, one might not be quite as con- 
cerned about the possibility of accepting an unsatis- 
factory efficiency, and so permit a risk of perhaps 
one in ten that this type of incorrect decision might 
be made. Although a risk always exists, therefore, 
that a satisfactory television program efficiency will 
be erroneously rejected or that an unsatisfactory 
television program efficiency will be erroneously ac- 
cepted on the basis of evidence from a sample, one 
is privileged to establish the limits of risk prior to 
the survey. 

Those familiar with the use of sequential analysis 
in most inspection processes will recall that because 
the majority of items inspected will meet prescribed 
specifications one is normally concerned with the 
rapid determination of the percentage of defective 
items. However, when conducting a survey to deter- 
mine the proportion of homes exposed to a television 
program, the logical method is to record successes, 
or homes exposed to the program. 

Having established desirable and undesirable effi- 
ciencies for a certain television program as well as 
risks of making incorrect decisions on the basis of 
the sample evidence, and knowing the costs and the 
number of homes in a market, a Sequential Decision 
Table may be constructed. The table will indicate, 
for successive sample sizes, the number of homes ex- 
posed to the program—denoted by V: and Vz for 
“viewer”—associated with the two hypotheses, Hi: 
and H2; and will include a blank column upon which 
one may record and cumulate the actual number of 
home exposures determined from the interviewing. 

Sequential analysis, as a procedure to examine sam- 
ple items sequentially and to cumulate the evidence 
until a decision is reached, assumes that the charac- 
teristics of the sample, as well as the universe from 
which the sample is drawn, remain constant during 
the period of the inspection process. Because the nu- 
merical size of audience exposed to a television pro- 
gram may fluctuate during the program, the survey 
method should produce a “total audience” rather 
than an “average audience” statistic. Such data may 
be obtained by inquiring of a sample immediately 
following the program (or program segment) re- 
garding set usage during the program. At this time 
the total audience, that is, the number of homes ex- 
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posed to all or part of the program, is completely 
established, and one may with confidence proceed to 
estimate that size sequentially or otherwise. 

The telephone interview (immediate recall) method 
is suggested as a satisfactory method to determine 
total audience. If all calls concerned with a half-hour 
program are completed within 30 min. of the pro- 
gram’s conclusion, a sufficiently short length of time 
will elapse prior to a respondent being asked to re- 
call television set usage to preclude significant mem- 
ory lapse. Myers (1956) has demonstrated that, for 
television program audience size estimation, a tele- 
phone sample is sufficiently representative of all 
homes in a market in which residential telephone 
subscriptions are 80% or higher. Use of a properly 
selected telephone probability sample also satisfies 
the sequential requirement that each interview be 
drawn at random from the population and be inde- 
pendent of all other interviews. 

Using a previously selected sample of homes, a 
small team of telephone interviewers conducts calls 
from a central location, thus permitting consecutive 
accumulation and analysis of the sample data ac- 
cording to sequential requirements. As set usage data 
is obtained by each operator, an inspector keeps a 
cumulative account of the number of homes reached 
and the number of homes tuned to the program in 
question. By comparing these running totals with an 
appropriate sequential decision table, a decision can 
be made within minimum time and with a minimum 
sample. 


AN ILLUSTRATION 


To illustrate the procedure, assume that it 
is desired to assess the efficiency of the “Ford 
Theater” television program in the city of 
Syracuse, New York. The advertiser estab- 
lishes efficiencies of $10/M (desirable) and 
$20/M (undesirable), as well as risks of 
a = .05 and 8 = .10 of making incorrect de- 
cisions on the basis of the sample evidence. 
(Let us assume that, associated with these 
efficiencies for this market, the program costs 
—time charges and prorated production costs 
—are $900 and the number of homes is 
300,000.) It should be re-emphasized that 
these specifications of efficiencies and risks 
completely determine the sequential sampling 
plan and permit one to set up tables to fa- 
cilitate the collection and evaluation of sub- 
sequent survey data. Table 1 shows the Se- 
quential Decision Table conforming to the 
above specifications. The table also shows the 
results of an actual series of interviews to 
ascertain the efficiency of “Ford Theater.” 

Examination of the table shows that the 
first home interviewed saw the program, as 


TABLE 1 
SEQUENTIAL Decision TABLE FOR “ForD THEATER” 


(H; = $10/M H: = $20/M a= .05 = .10) 
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Note.—Interviewing is concluded after the 29th completed 
call, and the hypothesis, Hi, that the efficiency of ‘Ford 
Theater” was better than $10/M, is accepted. 


did the second home. The third through the 
thirteenth homes did not see the program for 
one of a number of reasons. The fourteenth, 
sixteenth, nineteenth, and twenty-first homes 
were tuned to the program. If one halts the 
interviewing momentarily and examines the 
data accumulated at the conclusion of the 
twenty-first interview, one notes that the 
number of homes found viewing the program 
(Veum = 6) is larger than Ve (1) so that one 
cannot make the decision that the efficiency 
is poorer than $20/M. At the same time, 
Veum is smaller than V; (8) so that one is 
likewise unable to make the decision that the 
efficiency is better than $10/M. Interviewing 
is therefore continued. 
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The twenty-third, twenty-seventh, and 
twenty-ninth homes are found to have viewed 
the program. At this point the cumulated 
homes that viewed the program equaled the 
V, acceptance number of 9, so the interview- 
ing was terminated with the decision that the 
efficiency of “Ford Theater” was better than 
$10/M. 

The efficiency of the program based on an 
estimate from a larger sample size of 267 was 
later computed to be $8.50/M, verifying the 
earlier decision made after a sample of only 
29 homes. Numerous examples demonstrating 
the conformity of empirical data to theory 
may be found in Myers’ dissertation (1956). 

As will be noted from the illustration, a 
unique feature of any sequential analysis plan 
is the fact that the size of the sample is un- 
known prior to the survey. Practically, it will 
be necessary to decide arbitrarily on a ter- 
mination point for the interviewing procedure. 
Although sample size is undetermined prior 
to the survey, it is possible to compute the 
sample size that will be required, on the av- 
erage, for decisions when repeated surveys are 
made with a given sequential plan. The aver- 
age number of interviews will depend upon 
the characteristics of the plan and upon the 
efficiency of the program being surveyed. 
Specifically, the smaller the risks, the greater 
the amount of inspection; also the smaller 
the difference between the two efficiencies, 
the greater the amount of inspection; and the 
greatest average amount of inspection is re- 
quired for those programs whose efficiencies 
lie between the two hypotheses. An Average 
Sample Number (ASN) curve (Statistical 
Research Group, 1945) associated with a se- 
quential plan can be drawn to show the rela- 
tionship between various program efficiencies 
and the average sample size required for each. 
The maximum ordinate pf the ASN curve— 
denoted by fi,—will indicate the program effi- 
ciency that will require the greatest average 
amount of inspection. Since the most unfa- 
vorable condition for the research in terms of 
the amount of sampling needed is given by 
the maximum ordinate, it is recommended 
that the average sample size represented by 
fi, be established as a satisfactory termina- 
tion point for the survey. 

In most instances decisions will be made 
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prior to reaching the recommended sample 
size, as was the case with the “Ford Theater” 
illustration where the decision was made at 
n = 29 when ii, was actually computed to be 
48. In the event, however, that it is impos- 
sible to reach a decision prior to interviewing 
a sample of size fi,, then the survey should be 
concluded with no decision but with the im- 
plication that the cost per thousand prob- 
ably lies somewhere between the two desig- 
nated limits. 


DISCUSSION 


Does this application of sequential analysis 
to the telephone survey method effect a sub- 
stantial savings in cost? The major cost item 
involved, once the sequential decision plan is 
prepared, is that of interviews. Using effi- 
ciency limits and risks described above, most 
television programs that are broadcast dur- 
ing the evening hours will need only two tele- 
phone operators to obtain a sample sufficiently 
large to reach a decision. If most efficiency 
decisions can be made with two operators— 
with a few programs requiring three or four 
operators at the most—the opportunity to ob- 
tain similar data frequently in many markets 
becomes economically feasible. 

The sequential model will be of primary 
concern to advertisers who use either network 
or spot campaigns to sponsor television pro- 
grams in many markets. To ascertain general 
strengths and weaknesses quickly, these ad- 
vertisers should be interested, among other 
things, in a continuous running account of 
their program efficiencies in each market; and 
they may, therefore, adopt sequential models 
that satisfy their advertising and marketing 
plans if such data is obtainable for only a 
few dollars per market. One will note that 
this method deliberately precludes any men- 
tion of program ratings with their accompany- 
ing measurement errors. Likewise, the method 
does not yield the actual efficiency obtained 
by a television program. Rather, a definitive 
efficiency concept has been employed; namely, 
that for each market, a television program’s 
efficiency is better than a designated value, or 
worse than another designated value, or prob- 
ably lies somewhere between the two. 

An advertiser thus has available an inex- 
pensive and rapid technicue for evaluating 
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the efficiency of a television program on a 
market-by-market basis. Whether he presents 
his program in 20, 50, or 150 markets, he can 
maintain weekly checks on his efficiency in 
each market at a fraction of the cost nor- 
mally required to produce individual program 
ratings. As weaknesses are exposed in various 
markets, appropriate individual analysis and 
action can then be taken to correct these de- 
ficiencies. 


SUMMARY 


This paper has presented a new concept 
and method for determining the commercial 
effectiveness of a television program. Effi- 
ciency, or cost per thousand home exposures, 
is selected as the criterion. The technique of 
sequential analysis is adapted to the tele- 
phone survey method to permit, with stated 
risks of error, a decision that the efficiency 
of a television program is better than a desig- 
nated value, or worse than another designated 


value, or probably lies somewhere between 

the two limits. The combination of small cost 

and speed with which such efficiency deci- 

sions may be made suggests the use of a se- 

quential decision plan by advertisers in many 

markets on a weekly basis as a tool in evaluat- 

ing the effectiveness of their television ad- 

vertising. 
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During recent years, the United States Em- 
ployment Service has been actively working 
toward the development of a truly functional 
job classification system. To this end, their 
researchers have analyzed 4,000 jobs, selected 
from the Dictionary of Occupational Titles, 
or DOT (1949), with respect to eight com- 
ponents or possible bases of classification. 


PROBLEM 


The present study employed the data gath- 
ered by the United States Employment Serv- 
ice for these 4,000 jobs for only one com- 
ponent, aptitudes necessary for successful job 
performance. It employed the Distance (D) 
Measure, recently discussed by Cronbach and 
Gleser (1953), as a means of classifying jobs 
into clusters, relatively homogeneous with re- 
spect to the aptitudes measured. Two general 
questions were investigated: (a) the extent 
to which the D Measure offers a consistent, 
workable technique for clustering jobs on the 
basis of aptitude data; and (5) the extent 
to which the job clusters resulting from this 
clustering technique are consistent, reason- 
able, and describable in terms of outstanding 
aptitudes or patterns of aptitudes. 

Thus, the present study was primarily ex- 
ploratory. It dealt with only one of the com- 
ponents defined by the United States Employ- 
ment Service, aptitudes necessary for success- 
ful job performance; it did not contemplate 
a complete approach to the more general 
problem of functional job classification. 


THE STATISTIC 


The D Measure was chosen for study as 


1 The study upon which this paper is based was 
submitted in partial fulfillment of the requirements 
for the Doctor of Philosophy degree at Teachers 
College, Columbia University. The author wishes 
to express his gratitude to Robert L. Thorndike, un- 
der whose guidance the study was done, for his help 
and encouragement. 
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A NEW METHOD FOR CLUSTERING JOBS’ 


DAVID B. ORR 


Teachers College, Columbia University 


44 


proach to the general problem of classification 
and to the specific problem of job classifica- 
tion. Under the assumptions of: (a) equal 
unit scales, (5) equivalence of metric from 
one aptitude to another, and (c) independ- 
ence among the aptitudes; the D Measure 
can be thought of as a generalization of the 
geometric formula for the distance between 
two points in a plane, viz., 


Diz = V (x2 — 91)? 


In the present study, these assumptions ap- 
peared to be reasonable. 

The present study dealt with several sam- 
ples of jobs, each of which had been rated as 
to the amount of each of nine aptitudes neces- 
sary for successful job performance. (The 
samples, rating scale, and aptitudes are de- 
scribed in a later section of this paper.) In 
the use of the D Measure with these data, 
each of the nine aptitudes was taken as an 
orthogonal dimension in higher dimensional 
space, and the distance concept was general- 
ized to this higher dimensional case as fol- 
lows: Let Dj; be the distance between Obser- 
vations (jobs) i and j, and let @ refer to any 
one of the m (aptitude) dimensions (a = 1, 
2,...,9); let Xj, and X;, be the score values 
(ratings) for Observations i and j, respec- 
tively, on Dimension a; then 


Dj; Nps (Xie — Xja)* 
a=! 


The basic use of this statistic in the pres- 
ent problem was literally to say that jobs 
were or were not distant from one another 
in order that they might be divided into rela- 
tively homogeneous groups which would be 
relatively distinct from one another with re- 
spect to the underlying system of aptitude di- 
mensions. This was done by considering jobs 
which exhibited small Ds among themselves 
to be similar and to be dissimilar to other 
jobs with which their Ds were large. 
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TABLE 1 


Tue Apritupes AND DEFINITIONS 


Code Aptitude 

G Intelligence: General learning ability: the ability to “catch on” or understand instructions and under- 
lying principles; ability to reason and make judgments. 

V Verbal: Ability to understand meanings of words and ideas associated with them, and to use them 
effectively ; to comprehend language and understand relationships between words and to understand 

' meanings of whole sentences and paragraphs. 

N Numerical: Ability to perform arithmetic operations quickly and accurately. 

S Spatial: Ability to comprehend forms in space and understand relationships of plane cad solid objects; 
may be used in such tasks as blueprint reading and in solving geometry problems; frequently described 
as the ability to “visualize” objects of two or three dimensions. 

P Form Perception: Ability to perceive pertinent detail in objects or in pictorial or graphic material; to 
make visual comparisons and discriminations and see slight differences in shapes and lengths. 

a) Clerical Perception: Ability to perceive pertinent detail in verbal or tabular material; to observe differ- 
ences in copy, to proofread words and numbers, and to avoid perceptual errors in arithmetic com- 
putation. 

K Motor Coordination: Ability to coordinate eyes and hands or fingers rapidly and accurately in making 
precise movements with speed; ability to make a movement response accurately and quickly. 

F Finger Dexterity: Ability to move the fingers, and manipulate small objects with the fingers, rapidly 
or accurately. 

M Manual Dexterity: Ability to move the hands easily and skillfully; ability to work with the hands in 


placing and turning motions. 


PROCEDURE 


The general procedure for the study was to take 
the aptitude data compiled by the United States Em- 
ployment Service * and apply to it a clustering pro- 
cedure, using the D Measure as suggested by Thorn- 
dike (1953). 

Description of the Data. For the purposes of this 
study, aptitudes were defined as the specific capaci- 
ties required of an individual in order to facilitate 
the learning of some task or job duty. Nine apti- 
tudes derived from the United States Employment 
Service General Aptitude Test Battery, or GATB 
(1946-47), were used as the basic dimensions of this 
study. These nine aptitudes and their definitions 
are presented in Table 1. 

The ratings of each job for each of the nine apti- 
tudes indicated how much of each aptitude each job 
required for satisfactory, not maximum, perform- 
ance. These ratings were made, according to the 
scale defined in Table 2, by six specially trained 
raters, using detailed job descriptions as their raw 
data. Each of these ratings was reviewed by other 


8 The investigator would like to express his ap- 
preciation to Sidney A. Fine, Chief, Entry Occupa- 
tions Section, Bureau of Employment Security, 
United States Department of Labor, and to the 
United States Employment Service for making these 
data available for the study. 


raters, and on the whole, no significant disagreement 
was found. The few cases of disagreement which 
did appear were resolved by the joint action of the 
rater and the reviewer. 

Tratner, Fine, and Kubis (1955) have explored the 
intraclass reliabilities of the ratings for the nine apti- 
tudes employed in the present study. These reli- 
abilities ranged from 0.79 to 0.96 with a median 
value of 0.91. Validities of the ratings were on the 
order of 0.5 to 0.6 as measured against average 
GATB scores for successful workers in a number of 
occupations. 


TABLE 2 


Tue DEFINITION OF THE RATING SCALE 


working population. 

2—An amount possessed by the highest third, exclusive 
of the top 10%. 

3—An amount possessed by the middle third of the 
working population. 

4—An amount possessed by the lowest third, exclusive 
of the lowest 10%. 

5—An amount possessed by the lowest 10% of the 
working population. 
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TABLE 3 


Custer CENTROIDS AND SAMPLE Joss FoR CLusTers FOR SAMPLE A 


Cluster Centroids 


I II 


Aptitude (N = 14) (N = 38) 


Il IV V 
(N = 35) 


(N = 10) (N = 20) (N = 23) 


Typical Jobs by Cluster 


: Meter Engineer, School Nurse, X-Ray Technician. 

: Cement Finisher Helper, Cooky Mixer Helper, Hydraulic Riveter Helper. 
: Tubular Furniture Maker, Transformer Tester, Steam Fitter. 

: Rate Analysis Clerk, Public Accountant, Composer. 

: Coal Inspector, Service Establishment Attendant, Fish Inspector. 

: Machine Driller, Stain Sander, Steam Table Attendant. 


The Samples. The original, master sample of 4,000 
jobs was chosen by the personnel of the United 
States Employment Service with regard for the rep- 
resentativeness of the sample on each items as code 
area in the DOT, variety of function, industry, and 
level of complexity. The final selection of the 4,000 
jobs in the Master Sample was made according to 
the percentages of the sample thought necessary to 
represent each of the major code areas of the DOT 
adequately. 

Two samples were used in the clustering procedure. 
These were completely independent, stratified, ran- 
dom subsamples of the Master Sample, drawn ac- 
cording to the same schedule of percentages as that 
used to select the Master Sample. These samples, 
hereafter called Samples A and B, were drawn with 
replacement and were of 140 jobs apiece. While no 
jobs appeared more than once in either sample, four 
members of Sample A were also members of Sam- 
ple B. 

Subsequently Sample C, consisting of 28 jobs, four 
chosen at random from each of the seven major 
code areas of the DOT, was drawn without replace- 
ment. 

Treatment of the Data. The first step in the pro- 
cedure was to determine an optimum number of 
clusters for the data. Then, each of the independ- 
ent samples, A and B, was clustered independently 
for this optimum number of clusters. A statistic of 
consistency was devised to indicate the similarity of 
the two independent cluster structures. Finally, the 


clusters were compared for consistency and reason- 
ableness of content. 

The graphic procedure suggested by Thorndike 
(1953) was employed to determine the optimum 
number of clusters. No definitive decision could be 
made on the basis of this procedure, but six was the 
number of clusters tentatively identified as optimum 
for these data. Therefore, the remainder of the 
study dealt with the clustering of Samples A and B 
for six clusters. 

The Clustering Process. The specific mechanisms 
of the clustering process followed Thorndike (1953) 
closely. This procedure results in clusters which are 
maximally compact over the entire structure, i.e., 
the mean distance for all observations from the 
members of their respective clusters is a minimum 
over all of the clusters. Then the mean score (rat- 
ing) of the members of a cluster on each dimension 
is computed for each of the clusters. Where there 
are m dimensions, these m means taken collectively 
for a cluster are termed the centroid of the cluster. 

The Statistic of Consistency. In order to deter- 
mine the consistency of the two independent cluster- 
ings of Sample A and Sample B, it was necessary to 
devise a statistic. It was finally decided to match 
the six clusters from Sample A with the six clusters 
from Sample B, subject to the restriction that each 
cluster must have one and only one mate in the 
other clustering. This matching was accomplished 
by representing each cluster by its centroid and com- 
paring the intercentroid distances using the D meas- 
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ure. Thus the mate for each cluster was the cluster 
in the other sample whose centroid was closest to 
its own. 

Once the clusters were matched, each of the 28 
jobs in Sample C was assigned to a cluster in both 
the Sample A clustering and the Sample B cluster- 
ing. These assignments were made by computing 
the Ds from these jobs to each cluster centroid and 
placing each job into the cluster to which it was 
closest. The statistic of consistency was the per- 
ceniage of Sample C assigned in this manner to 
matched clusters. 


RESULTS 


Presentation of the complete clusterings is 
too lengthy to include here, but Tables 3 and 
4 present the cluster centroids (mean ratings 
for each aptitude) and several “typical” jobs 
from each cluster for the optimum (& = 6) 
clusterings for Samples A and B, respectively. 
Table 5 shows the extent to which the 28 
jobs in Sample C, and also the four Sample A 
jobs which were repeated in Sample B, fell 
into matched clusters. Twenty-four of these 
32 jobs or 75% fell into matched clusters, 
and 90.6% of them fell into the closest or 
next closest to the matched cluster in both 
samples. 


The First Question 


The first question underlying this investi- 
gation concerned the consistency and work- 
ability of the clustering technique based upon 
the use of the D Measure. Several results 
apr. .° to bear on this question. The tech- 
nique is consistent and workable to some ex- 
tent because it was possible to match the 
clusters from Sample A to those derived in- 
dependently from Sample B. 

It is felt that, within the limitations im- 
posed by the design and the assumptions un- 
derlying the technique, the percentage of jobs 
falling into matched clusters was good, and 
supports the workability and consistency of 
the technique. It is possible that those jobs 
falling outside of matched clusters might in- 
dicate another cluster. However, an exami- 
nation of the jobs involved, and their ratings, 
does not seem to support this conjecture. It 
is more likely that these are jobs which fell 
at the boundaries of the present clusters and 
were moved into unmatched clusters through 
the action of the various kinds of error in- 
volved in the procedures. 

Further evidence of the consistency of the 


TABLE 4 


CLusTER CENTROIDS AND SAMPLE Jops For Six CiusTers For SAMPLE B 


Cluster Centroids 


III 


(N = 


Typical Jobs by Cluster 


: Engineering Specification Writer, Pattern and Foundry Inspector, Wood Inspector. 
: Cameraman Assistant, Cooky Mixer Helper, Pipelayer Helper. 


: Actor, Law Examiner, Parole Officer. 


: Shop Ironworker, Rock Cutter, Atomic Welder. 
: Beef Weighing Clerk, Companion, Depot Master. 
: Cable Splicer, Hand Sewer, Thermite Welder. 
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TABLE 5 


PLACEMENT OF SAMPLE C Joss IN SAMPLE 
A anp B CLustTERsS 


Clusters 


Sample Sample 
A B 


Job Title 


Bootblack 

Cheese Maker 

Chitterling Splitter 

Coremaker, Machine IT 

Diesel Engine Erector 

Drafting Clerk 

Farm Mechanic 

Fisherman, Oyster 

Fisherman, Shrimp 

Geophysicist 

Graphotype Operator 

Establishment Guide 

Hemstitching Machine 
Operator 

Ingredient Scaler 

Investigator I 

Joggle Press Operator 

Lobby Man 

Mgr., Trade-Mark, Copy- 
right 

Marker, Shipments I 

Painter, Yachts, Small 
Boats 

Photographer, Commercial 

Scalder 

Second-Ride Fare Collector 

Sheep Farmer 

Size-Maker Helper 

Soda Dispenser 

Solder Maker 

Zoologist 


Sample A Jobs Repeated in Sample B 


Molder, Finish III 
X-Ray Technician I 
Cocky Mixer Helper Il 
Reclamation Division 

Inspector vs 


Note.—Unfootnoted entries indicate that the job fell into 
matched clusters. 


® Denotes jobs falling into the next closest clusters in both 
samples. 
b Denotes any larger disagreements. 


results is the fact that the results seemed to 
be unique. During the process of determin- 
ing the optimum number of clusters, the clus- 
ter structure for Sample A for & = 3 was ob- 


tained twice, using vastly different initial clus- 
ters. Nevertheless, the redistribution steps 
of the procedure produced final cluster struc- 
tures which were identical. 


The Second Question 


The second question concerned the reason- 
ableness and describability of the clusters in 
terms of outstanding aptitudes or patterns of 
aptitudes. Evidence on this question may be 
had from an examination of the centroids pre- 
sented in Tables 3 and 4, and from the face 
validity of the content of the clusters. Ro- 
man numerals identify the clusters in the or- 
der in which they emerged; primes signify 
Sample B clusters. Matched clusters are 
combined for descriptive purposes. 


Cluster I-I'. This cluster is characterized by high 
level technical, supervisory, and mechanical jobs. 
The aptitudes required are Intelligence, Verbal, Nu- 
merical, Spatial, and Form Perception. Clerical is 
rated low. 

Cluster II-II'. This cluster is composed of ex- 
tremely low level jobs, primarily unskilled. 

Cluster III-IV'. This cluster is composed largely 
of fairly high level skilled jobs, mostly mechanical 
and artisan types. The emphasis is on Spatial, Form 
Perception, Manual Dexterity, and Finger Dexterity. 

Cluster IV-III'. The jobs in this cluster are of 
very high level with respect to their requirements of 
Intelligence and Verbal. Supervisory jobs are con- 
tained in this cluster, and there is some emphasis on 
clerical abilities. 

Cluster V-V’. The emphasis here is on clerical 
jobs and supervisory jobs at a lower level than those 
in Cluster I-I’. The aptitudes required in the great- 
est amounts are Intelligence, Verbal, Numerical, and 
Clerical. 

Cluster VI-VI'. This cluster features mechanical- 
manual jobs of a medium grade of skill. Intelli- 
gence, Finger Dexterity, and Form Perception are 
required, but no aptitude is needed in more than an 
average amount. 


Although patterns of distinguishing apti- 
tudes were not as clear as might have been 
desired, the content of the clusters seemed to 
support the hypothesis that the clusters were 
reasonable and describable in terms of pat- 
terns of aptitudes. An important finding is 


_that the chief differentiations among clusters 


appeared to be both in terms of kind and 
level. Three different kinds of clusters ap- 
peared: intellectual-supervisory, mechanical- 
manual, and clerical. Within these types, sev- 
eral clusters were differentiated from others 
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chiefly on the basis of levels of aptitudes re- 
quired. 


DIscussION 


In general, the approach explored appears 
to hold some promise for dealing with the 
classification and functional structure of vari- 
ous job domains. In assessing the practical 
value of the approach, however, several points 
should be made. : 

The procedure as here employed is highl 
impractical for any sizable amount of data 
without recourse to mechanical data process- 
ing equipment for the computation of the D 
matrix, and for the computation of the mean 
intercluster and intracluster distances in the 
reassignment procedure. 

Two limitations of the D Measure itself 
should be mentioned. First, being based 
upon a sum of squared differences, the D 
Measure takes no account whatsoever of the 
direction of those differences. Second, no D 
value is unique; it could be made up of a 
large difference between the two jobs on only 
one dimension, or of smaller differences on all 
of the dimensions, or of any state of affairs 
in between. These factors suggest that the 
D Measure may misrepresent the facts, to 
some extent, blurring the distinction between 
kind of job and level of job. 

In addition, certain other factors limit the 
interpretation of the results. First, the sam- 
ples used were small, and those small groups 
of jobs requiring specific or high level apti- 
tudes tended not to be found in the same 
proportions, if at all, in both samples. Sec- 
ond, the amount of spread in the ratings was 
small, thus almost certainly curtailing the 
discriminatory power of the technique. Fi- 
nally, it must be remembered that the cluster 
structures were the result of an averaging 
process, and thus some jobs could fit hardly 
at all in their clusters, and yet fit there better 
than elsewhere. 


SUMMARY 


Aptitude ratings made by the United States 
Employment Service of the amount of each of 
nine aptitudes necessary for success in cer- 


tain jobs were employed to explore the effec- 
tiveness of the Distance (D) Measure as a 
statistic for the clustering of jobs. Two ques- 
tions were considered: (a) the extent to which 
the D Measure offers a consistent, workable 
technique for clustering jobs on the basis of 
aptitude data; and (b) the extent to which 
the job clusters resulting from such a cluster- 
ing technique are consistent, reasonable, and 
describable in terms of outstanding aptitudes 
or patterns of aptitudes. 

Six was found to be the best number of 
clusters for these data, and two independent 
samples of 140 jobs apiece were each clus- 
tered independently for six clusters. Tables 
showing the cluster matching and cluster cen- 
troids were presented, and a statistic of con- 
sistency between the two samples was devel- 
oped and applied. The content of the clusters 
exhibited show a differentiation in terms of 
intellectual-supervisory, _mechanical-manual, 
and clerical jobs, with further differentiation 
on the basis of level of aptitudes required for 
success. 

It was tentatively concluded that the use 
of the Distance Measure, and the clustering 
technique employing it to define job struc- 
tures showed sufficient promise to warrant 
further investigation. 
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The major purpose of this experiment was 
to evaluate the effects on component move- 
ments in skilled motions of stress produced 
by prolonged sleep loss. Studies in which the 
effects of physiological stresses on human mo- 
tor performance have been investigated have 
usually employed a variety of tests which 
yield measures of overall accomplishment, 
such as number of units completed or num- 
ber of errors per trial. In the present experi- 
ment the major emphasis is on the evalua- 
tion of changes occurring in different types 
of movements within the same motion pat- 
tern. Specifically, the study was primarily 
concerned with the effects of 60 hours of 
sleeplessness on the duration of travel and 
manipulation movements in patterned mo- 
tions. 

The value of component analyses of mo- 
tion in relation to stress was suggested by an 
earlier investigation of prolonged food re- 
striction. In this study (Harris, Brozek, & 
Smith, 1958) it was shown that a 24-day pe- 
riod of maintenance on a 1,000-calorie, all- 
carbohydrate diet produced a deterioration in 
gross movements of the arms and body 
(travel). No systematic change in the finer 
manipulative movements of the fingers, wrist, 
and lower arm was evident. It is of interest 
to determine whether the differential effect on 
component movements demonstrated in the 
food restriction study is specific to nutri- 
tional deficit or is a more general reaction to 
psychophysiological stresses of various types. 

In addition to a panel-control task used in 
appraising the duration of motion compo- 
nents, the present study included four other 
tests of perceptual and motor functions. 


1 This paper is based on a thesis submitted to the 
Department of Psychology of the University of Wis- 
consin in partial fulfillment of the requirements for 
the PhD degree. The research was supported by 
funds granted by the Graduate School Research 
Committee, University of Wisconsin. 


THE EFFECTS OF SLEEP LOSS ON COMPONENT 
MOVEMENTS OF HUMAN MOTION’ 


SHELBY J. HARRIS 
Lehigh University 


METHOD 


The general plan of this experiment consisted of 
the administration of several special tests of percep- 
tual and motor iunctions in a 5-day training period, 
in a 3-day sleep loss period, and in a 2-day recovery 
period. 

Prior to the sleep deprivation period each S was 
given five consecutive days of practice on all of the 
tests used in the study. Data were recorded only on 
the last two days of the training period. In order to 
evaluate diurnal variations in performance during 
the three-day period of sleeplessness, tests were ad- 
ministered in the morning (5-8 a.M.), about noon 
(11 aM. to 1 P.m.), in the early evening (5-8 P.m.), 
and midnight (11 p.m. to 1 a.m.). All tests, with the 
exception of the panel-control task, were used in 
each test session. To lessen the risk of Ss developing 
finger blisters, the panel task was included only’ in 
the morning and evening sessions. 

All of the tests were administered again once on 
each of the two days immediately following the 
sleep loss period. With two exceptions the test ses- 
sions took place in the afternoon, usually between 3 
and 6 p.m. In two cases the tests were administered 
in the latter part of the morning on the first recov- 
ery day. 

The Ss were closely supervised during both nights 
and the last two days of the sleep loss period, ex- 
cept at mealtimes when they were allowed to return 
to their living quarters for an hour or less without 
supervision. On the second day of the sleep loss pe- 
riod, some Ss were also permitted to leave the labo- 
ratory in order to attend classes, perform meal jobs, 
carry out necessary activities at other labs and, on 
one occasion, to attend a football game. The Ss typi- 
cally spent their time in rather light activities such 
as studying, playing cards, throwing darts, taking 
short walks, etc. 

The Ss were allowed to sleep as long as they de- 
sired on the nights preceding and following the sleep- 
less period. On the night immediately preceding sleep 
loss, Ss slept an average of about 7.5 hr. Immedi- 
ately following sleep loss an average of 12.3 hr. of 
sleep was taken. 

No independent control group was used in the 
study. It was decided to use the data from the first 
sleep loss day and the recovery period as a baseline 
to evaluate changes accompanying sleep loss. This 
decision to use Ss as their own controls was based 
on several considerations. Previous data on the panel- 
control test, which was the task of primary interest 
in the study, indicated that the amount of training 
given during the training period should be adequate 
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to lead to asymptotic performance on both motion 
components (Harris et al., 1958). Further, the prac- 
tical difficulties in obtaining voluntary Ss to submit 
to the rigors of the experiment and the unfeasibility 
of testing Ss at 6-hr. intervals without interfering 
with their sleep schedules dictated against the use 
of an independent control group. 

Subjects. Nineteen male students at the University 
of Wisconsin served as Ss. Eight Ss were graduate 
students and 11 were undergraduates enrolled in psy- 
chology courses. The Ss ranged in age from 19 to 
26 yr. The Ss participated in groups of three and, 
on one occasion, four individuals. 


Tests and Test Procedures 


Panel-Control Test. The apparatus and testing pro- 
cedure used for assessing the duration of travel and 
manipulation movements in a motion pattern has 
been previously described (Smith. & Wehrkamp, 
1951). Briefly, S’s task consisted of turning a series 
of 28 rotary switches mounted on a control panel. 
Separate measurement was obtained of the time in- 
volved in turning (manipulation) and in moving be- 
tween switches (travel). Two conditions of perform- 
ance were investigated: a 10-min. period of con- 
tinuous performance at a “normal” or “comfortable” 
pace and a 2-min. period of continuous performance 
at maximum pace. These conditions were the same 
as those employed in the food restriction study 
(Harris et al., 1958). 

Bimanual Coordination. This test consisted of four 
columns of five two-position, toggle-type switches. 
In bimanual performance S worked back and forth 
with the right hand between the two columns of 
switches on the right and, at the same time, with 
the left hand.between the two columns of switches 
on the left. The distance between the outside and 
inside columns was 18.5 in. Total time involved in 
flipping the 20 switches was automatically regis- 
tered on an electric timer. 

Two bimanual patterns of travel motion were 
used. These were “simultaneous” and “mirrored” mo- 
tions. In the simultaneous motion patterns, the two 
arms moved between the columns of switches in the 
same direction at the same time. In mirrored mo- 
tions, the two arms moved simultaneously in op- 
posite directions between the columns of switches. 
Time-scores for five subtests of each type of motion 
pattern were summed and treated as a single score. 

Two tests of unimanual performance were also in- 
cluded. The task was identical except that the two 
hands were used singly. One test involved perform- 
ance with the right hand and the other, perform- 
ance with the left hand. 

Instructions were given to work as rapidly as pos- 
sible without making any errors. Repetition of a 
subtest was required if an error was made. 

Leg Movement. The apparatus used in measuring 
the speed of a discrete leg movement was patterned 
after that developed by Brozek, Simonson, and Keys 
(1952). The S’s task was to throw his right leg 
straight up as far and as fast as possible. The dura- 
tion of the movement from the instant the foot 


leaves the floor to the point where the thigh is hori- 
zontal was recorded. Three trials were given at each 
test session. 

Steadiness. In this test S’s task was to hold a rod, 
} in. in diameter, as steady as possible with his right 
hand while resting his elbow on a table. Lateral de- 
viations of the rod illuminated one of two lights in- 
dicating the direction of S’s deviation. The number 
and duration of deviations from the center position 
were recorded. Three 30-sec. trials were given at 
each test session. 

Critical Flicker Frequency. A circular test patch 
# in. in diameter was viewed at a distance of 7 in. 
The brightness level of uninterrupted light beam 
was maintained constant at about 5.5 mL. The light- 
dark ratio was one, and monocular vision was used 
throughout. Unlimited time was allowed for S to 
make the adjustment to the point of fusion. Eight 
trials were given at each test session with alternate 
trials starting at flicker rates of varying amounts 
above and below fusion. 

Order of Tests. The following order of adminis- 
tration of the tests was used in each session: (a) 
steadiness, (b) CFF, (c) bimanual coordination, (d) 
leg movement, and (e) panel-control test. 


RESULTS 


Figure 1 shows the mean performance level 
for the manipulation and travel movements 
under the normal-pace condition. The move- 
ment durations are shown for each of the test 
sessions during sleep loss (SL), the last two 
training days (T) and the two recovery days 
(R). Thus, the figure pictures the general 
level of performance for each of the 10-min. 
work periods without consideration of changes 
in performance within work periods. Duncan’s 
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Fic. 1. Mean time per trial for the manipulation 
and travel movements under normal pacing at each 
of the test sessions during training (T), sleep loss 
(SL) and recovery (R) period. Any means which do 
not have a letter in common differ significantly. 
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Fic. 2. Mean time per trial for manipulation and 
travel movements under maximum pacing at each 
of the test sessions. Means which do not have a 
letter in common differ significantly. 


multiple-comparisons test (1951) was applied 


to the manipulation and to the travel time 
data.? The results of the Duncan test are also 
shown in Fig. 1. Letters are used to indicate 
groups of means which do not differ signifi- 
cantly. Any pair or group of means which do 
not have a letter in common are significantly 
different. 

Examination of the results of the Duncan 
test shows that the decrease in manipulation 
times during sleep loss is significant. The ma- 
nipulation values do not return to the level of 
control values on the two recovery days. 

The results of the Duncan test on the 
means of the travel movements demonstrate 
that the increase in travel times during sleep 
loss is not significant. Only the morning ses- 
sion on SL-3 exceeds either of the two SL-1 
control values and, there, it exceeds only the 
evening test session. Thus, only when the ef- 
fects of sleep deprivation are considered in 
conjunction with diurnal effects do any of 
the differences attain statistical significance. 

The analogous curves for the maximum- 
pace condition are shown in Fig. 2. The mean 
time per trial for the manipulation and travel 


2In all the analysis of variance and Duncan tests, 
significance at the .01 level was required before sig- 
nificance at the .05 level was accepted. This more 
conservative requirement seemed warranted because 
of the possible departure from normality of some of 
the data. Summaries of analyses of variance may be 
found in the thesis (Harris, 1955) upon which this 
paper is based. 
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movements during the 2-min. work period is 
plotted for each of the test sessions. In con- 
trast to the results on normal pace, the pro- 
nounced diurnal effects on travel movements 
under maximum pacing become apparent only 
after the loss of one night of sleep. Further, 
the mean travel times in the evening sessions 
continue to increase throughout the three days 
of sleeplessness. The travel times on the two 
recovery days are below the values on the 
first sleep loss day. 

The results of the Duncan test indicate that 
the general decline in manipulation times over 
the sleep loss and recovery periods is signifi- 
cant. The duration of the travel movements 
are significantly greater in the morning test 
sessions of the second and third days of sleep 
loss. While the travel times in the evening of 
the second and third sleepless days are not 
significantly greater than on SL-1, they are 
significantly greater than on either of the re- 
covery days. 

In order to evaluate changes in movement 
times within work periods, as contrasted to 
changes between work periods, the medians 
of the first three, middle three, and last three 
trials of each work period were selected for 
analysis. These medians were computed, sepa- 
rately, for the two motion components on the 
two conditions of pacing. Only the data from 
the test sessions during the sleep loss and re- 
covery periods were included in the analyses. 
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Fic. 3. Mean time per trial for the medians of the 
first three, middle three, and last three trials of each 
work period. Means shown are for the pooled data 
of the sleep loss and recovery periods. 
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Sleep Loss Effect on Human Motion 


In no instance was the Trials X Test ses- 
sion interaction significant. This lack of sig- 
nificance indicates that changes in component 
movement times within work periods (i.e., 
work decrcments) were not consistently al- 
tered during the course of the sleep loss and 
recovery periods. 

Figure 3 shows the changes in component 
movement times within a work period under 
the two conditions of pacing. The data for 
the sleep loss and recovery periods have been 
pooled to obtain the means shown in the fig- 
ure. Under normal pacing, neither of the mo- 
tion components shows a significant change 
over trials. On the other hand, both manipula- 
tion and travel times increase significantly 
during the 2-min. work periods at maximum 
pace. 

Bimanual Coordination. Figure 4 shows the 
mean total time for 16 Ss* on all 10 of the 
subtests of bimanual performance. The data 
for the mirrored and simultaneous motion pat- 
terns have been pooled to obtain the means 
shown in the figure. This pooling of the data 
is justified by the lack of significant differ- 
ential effects of sleep loss on the two types of 
motion patterns. Mirrored motions were su- 
perior to simultaneous motions throughout the 
experiment. 

Comparison of the mean performances in 
morning and evening sessions shows that the 
diurnal changes in performance are signifi- 
cant. With respect to the changes in perform- 
ance attributable to sleep loss, only the morn- 
ing session of SL-3 shows a significant de- 
terioration when SL-1 is used as the basis of 
comparison. However, performance on both 
recovery days is significantly superior to that 
of all test sessions of SL-3. 

The mean total times to perform the two 
unimanual tests are also shown in Fig. 4. The 
rationale for pooling the data from the two 
hands is the same as that for combining the 
data of the two motion patterns in bimanual 
performance, i.e., there is no significant dif- 
ferential effect of sleeplessness on perform- 
ance by the two hands. As may be seen in 
Fig. 4, the changes in unimanual performance 


3 Due to a change in the procedure in administer- 
ing and scoring the test of bimanual and unimanual 
coordination, the first group of three Ss was not in- 
cluded in the analysis of the data. 
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Fic. 4. Mean total time for the 10 subtests of bi- 
manual and for the two subtests of unimanual per- 
formance. Means which do not have a letter in com- 
mon differ significantly. 


are very similar to those of bimanual per- 
formance. 

Leg Movement.* The median duration of 
the three leg movements performed by each 
S at each test session were used in the analy- 
sis of the data. The mean of these median 
scores was then computed. The results of the 
Duncan test indicated that all three means 
on SL-3 were significantly above the SL-1 
values. These same means were not signifi- 
cantly higher than the mean on R-1 but were 
reliably greater than on R-2. The decrease in 
speed of a simple leg movement occurred en- 
tirely during the second night of sleep loss. 

Steadiness. The results on the test of hand 
steadiness were evaluated both in terms of 
number of “contacts” (deviations from the 
center position) and mean duration per con- 
tact. No reliable change in mean duration of 
contacts over test sessions of the experiment 
was demonstrated in analysis of variance of 
this data. 

In the analysis of the number of contacts, 
test sessions proved to be a significant source 
of variation. However, the change in number 
of contacts as a function of sleep loss was 
irregular. Only the means of the midnight ses- 
sion on SL-2 were significantly below the con- 
trol values. 


4For the sake of brevity, only a brief description 
of the results on the leg movement, steadiness, and 
CFF tasks is presented here. A more detailed ac- 
count is available elsewhere (Harris, 1955). 
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Critical Flicker Frequency. The median 
flicker fusion frequencies of the last three 
trials starting below fusion and the last three 
trials starting above fusion were used in the 
analysis of the data. Inasmuch as the fusion 
frequencies from initial settings above and 
below fusion did not differ significantly, the 
data from these two sources were combined. 
The results of the Duncan test demonstrated 
that the overall decline in CFF during the 
60-hr. sleep loss was significant. However, the 
significance of the decrease produced by the 
loss of one night of sleep was somewhat am- 
biguous. While all of the values of SL-2 were 
significantly below all means on SL-1, they 
were not significantly below the value on R-1. 


DIscuSSION 


The absence of an independent control 
group and the failure of the performances 
during the recovery period to return to the 
presleep loss levels make the interpretation 
of some of the data somewhat equivocal. In 
the CFF, steadiness and leg movement tasks 
and in the travel movements under normal 
pacing, the level of performance is essentially 
the same on SL-1 and during recovery. How- 
ever, the bimanual coordination test, both 
motion components on the panel-control test 
under maximum pacing and the manipulation 
movements under normal pacing show a su- 
perior performance during the recovery pe- 
riod. 

Within the design of the present experi- 
ment, the possibility that on some tasks detri- 
mental effects of sleep loss have been ob- 
scured or diminished by continued learning 
during the sleepless period cannot be pre- 
cluded. In the case of the bimanual coordi- 
nation test, where no previous learning data 
is available, there is no objective evidence 
against this possibility. However, on the panel- 
control task, if the results are considered in 
relation to earlier studies, it seems unlikely 
that practice alone is adequate to account for 
the improved performance on R-1 and R-2. 
Previous studies (Harris et al., 1958; Smith 
& Smith, 1955) indicate that the amount of 
training given prior to the sleepless period 
should have been enough to lead to asymp- 
totic or near-asymptotic performance on both 
motion components. 


It seems more likely that the improved per- 
formance is largely attributable to compen- 
satory motivation accompanying sleeplessness. 
While no systematic ratings of “effort ex- 
pended” were obtained, general observations 
of S’s test behavior and comments strongly 
suggested that the tests became more chal- 
lenging as sleeplessness progressed. If the 
higher standards of performance established 
late in the sleepless period carried over to 
the recovery period, this would account for 
the superior performance after sleep loss. It 
should be noted that improvement in per- 
formance during and after sleep loss occurs, 
primarily, in those tests in which self-pacing 
is an important aspect of the task. This would 
be expected if compensatory motivation were 
an important factor. 

A differential effect on manipulation and 
travel movements produced by a_psycho- 
physiological stress seems to be clearly dem- 
onstrated in the panel-control task. This re- 
sult is in agreement with the earlier study in 
which the same task was used to appraise per- 
formance changes accompanying food restric- 
tion. In both instances the deleterious effects 
of the stress condition were confined to the 
travel movements. 

The data on travel movements in the panel- 
control task and the unimanual and bimanual 
coordination tasks, in which travel movements 
predominate, show consistent diurnal varia- 
tions during the three-day sleep loss period. 
These observations, in conjunction with those 
noted above, suggest that variations in “physi- 
cal fitness” produced by psychophysiological 
variations are reflected primarily in repetitive 
travel movements rather than in other types 
of work movements. Such movements seem to 
be sensitive indicators of normal psycho- 
physiological variations as well as of the more 
extreme stress conditions produced by sleep 
loss and nutritional deficit. 


SUMMARY 


In this study the effects of 60 hr. of sleep- 
lessness on various psychomotor and percep- 
tual performances have been investigated. The 
major purpose of the experiment was to de- 
termine whether sleep loss produces differ- 
ential effects on component movements of 


| 
| 
| 
| 
| 
ty 
| 
# 
| 
pag 
| 
sh by 
| 
Wa 
| 
| 
| 


Sleep Less Effect on Human Motion 55 


motion. The results may be summarized as 
follows: 

1. The loss of sleep had ‘a differential effect 
on manipulation and travel movements in a 
panel-control task. The duration of manipu- 
lation movements tended to decrease during 
sleep loss. Travel movements, particularly un- 
der a maximum-pace condition, tended to in- 
crease in duration during the sleep loss period. 

2. Speed of .performance in a test of bi- 
manual and unimanual coordination gave re- 
sults similar to those obtained for travel 
movements in the panel-control task. 

3. Speed of a discrete leg movement de- 
creased during the sleepless period. 

4. The number of contacts in a test of 
hand steadiness showed an irregular change 
during the sleep loss period. No significant 
change in the duration of contacts was ob- 
tained. 

5. A significant decrease in CFF was found 
during sleeplessness. 

6. The data on travel movements in the 


panel-control task and the unimanual and bi- 
manual coordination tasks showed consistent 
diurnal variations during the three-day sleep 
loss period. 


REFERENCES 


Brozek, J., Srwonson, E., & Keys, A. A test of 
speed of leg and arm movements. J. appl. physiol., 
1952, 4, 753-760. 

Duncan, D. B. A significance test for differences be- 
tween ranked treatments in an analysis of vari- 
ance. Va. J. Sci., 1951, 2, 171-189. 

Hares, S. J. The effects of sleep loss on component 
movements in human motion. Unpublished doc- 
toral dissertation, Univer. of Wis., 1955. 

Harnis, S. J., Brozex, J., & Smitn, K. U. The ef- 
fects of caloric restriction on the travel and ma- 
nipulation components of human motion. Jnternat. 
A. angew. Physiol. einschl. Arbeitsphysiol., 1958, 
17, 34-41. 

Patricia A., & Smit, K. U. Effects of sus- 
tained performance on human motion. Percept. 
mot. Skills, 1955, 5, 23-29. 

Smita, K. U., & Wenrxamp, R. A universal motion 
analyzer applied to psychomotor performance. 
Science, 1951, 113, 242-244. 


(Received May 11, 1959) 


4 
i 
Ne 
rey 
| 
| 
a 
ay 
| 
. 
| 
Ne 
4 
pee 
> 


Journal of Applied Psychology 
Vol. 44, No. 1, 1960 


UNION AND MANAGEMENT TRAINEES—A COMPARATIVE 


Considerable research has been carried out 
in the area of investigating whether occupa- 
tional personalities exist and whether certain 
kinds of people tend to enter specific occupa- 
tions. Though studies have frequently failed 
to support a personality theory of occupa- 
tional choice, such negative findings appear 
not completely to disprove the theory, but 
seem, in many instances, to have been the 
result of inappropriate instruments, loosely 
defined occupations, and failure to precede 
the selection of tests by adequate study of the 
occupation, with an emphasis on the person- 
ality demands of the job. 

During recent years the literature has re- 
vealed an increasing concern by corporation 
executives with the problem of management 
succession and with appraisal of candidates 
for advancement. Concern with executive re- 
placements has also been voiced by a group 
of labor leaders who have noted the trend to- 
ward professionalization in management and 
in their own unions. 

In an attempt to study further the relation- 
ship of personality to occupational entry, the 
present investigation undertook to determine 
whether individuals considered by top union 
and management executives to be worthy ma- 
terial for training for leadership positions were 
different from each other in psychological 
areas. 


1 This study is based upon a dissertation submitted 
to Teachers College, Columbia University in partial 
fulfillment of the requirements for the PhD degree. 

2 The author would like to express his apprecia- 
tion to the members of his committee: Donald E. 
Super, Henry David, Joel R. Davitz, and Albert S. 
Thompson. Acknowledgment is also made to the 
International Ladies Garment Workers Union and 
the Institute of Management and Labor Relations of 
Rutgers University for financial support. Gratitude 
is similarly due to the Grace Line, Inc., and again 
to the ILGWU for making their management trainees 
available for study. 


STUDY OF PERSONALITY AND OCCUPATIONAL CHOICE’ 


HOWARD M. BOGARD 2 


Teachers College, Columbia University 


BACKGROUND AND PROCEDURE 


In 1950, a formal leadership training institute was 
inaugurated by the International Ladies Garment 
Workers Union (ILGWU). Students accepted for 
the one year course of training and who are hope- 
fully considered to be the future leaders of the union 
are, upon successful completion of their studies, guar- 
anteed placement within the union in positions of 
responsibility, depending upon their specific skills and 
union needs at the time. The 12-month training 
course is divided between full-time class attend- 
ance and field work under supervision of incumbent 
officers. 

Though carefully defined job specifications have 
not been constructed, the admissions committee has 
set forth general entrance requirements. At the pres- 
ent time candidates are generally expected to be, 
most of all, “dedicated to trade union ideologies.” 
Those accepted are expected to have a “strong so- 
cial feeling,” to exhibit a willingness to remain rela- 
tively detached from close family ties, and to aver 
acceptance of the fact that their earnings will rarely, 
if ever, compensate them adequately for the hours 
that they will be required to work. Intelligence, level 
of formal education, and previous employment his- 
tory are usually not considered too important by 
the union’s admissions committee. 

Job history, age, education, and military service 
are not verified, and the desired personality attributes 
are sought through a few rather brief though inten- 
sive screening interviews, and one appearance before 
the executive board of the union. 

Nineteen male students in the ILGWT! class gradu- 
ated on May 30, 1957 and 21 students in the union 
class who completed their training in May of 1958 
were selected to comprise the experimental group. 

The management group chosen, in order to be con- 
sidered comparable, had most of all to be comprised 
of trainees motivated to eventual leadership. As con- 
trols, it was desired that they be undergoing training 
for leadership, should be in an early stage of their 
job careers and should be as close as possible to the 
union group’s age level. Intelligence and level of 
formal education, though of some relevance, were 
considered an important variable only if the groups 
investigated were radically different. 

At its headquarters in New York City, the Grace 
Line, Inc., a major international shipping corpora- 
tion, conducts a systematic and well administered 
management training program. They recruit widely 
for candidates and the reported acceptance ratio of 
applicants is low. 

Though exceptions are made, superior intelligence, 
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high energy level, strong aggressiveness, and per- 
sonal maturity are generally regarded to be the most 
desired characteristics in applicants. Age and marital 
status are not considered to be of great importance. 
Candidates are briefly screened by the personnel de- 
partment, and those favorably considered are given 
a brief personality test. For those who successfully 
pass this hurdle, a full day of interviews with eight 
or nine officers of the corporation is then scheduled, 
and the decision of whether to accept or reject an 
applicant is made only after he has completed all of 
these steps. 

Forty males employed by the Grace Line com- 
prised the control group. Of these, sixteen were cur- 
rently enrolled in the training program, eighteen 
were recent trainees and six were new executive em- 
ployees. 


Choice of Study Instruments 


Segal (1953), in discussing techniques of 
studying occupations, emphasized the need to 
augment traditional job analysis methods by 
stressing the personality demands of the job. 
He suggested that the more relevant factors 
would then be exposed and made available 
for study. 

With such an orientation in mind, and 
based in part on the “ideal types’ sought by 
the union and management training program 
administrators and on material found in pub- 
lished biographies of union and management 
leaders, a “typical” trade union and manage- 
ment leader was drawn. Then, after propos- 
ing that specific kinds of early life experi- 
ences were significant in determining pres- 
ently revealed personality traits, four testable 
hypotheses built upon the “typical” trade un- 
ion and management leaders and on the noted 
approach to personality formation were for- 
mulated. 

It was assumed that differences would be 
most apparent in the areas of: 


. Social class identification 

. Values 

. Personality characteristics 

. Biographical and developmental experi- 
ences 


Due to Ss’ time-consuming instructional 
program, individual interviewing was not pos- 
sible and group administration of all tests 
was mandatory. After considering the re- 
quired administration time and the vocabu- 
lary of published tests which seemed particu- 


larly well suited to investigating the above 
four hypotheses, the Sims SCI Occupational 
Rating Scale, the Allport-Vernon-Lindzey 
Study of Values, and the California Psycho- 
logical Inventory were selected to study the 
first three areas. 

As it was the intention of the study to in- 
vestigate and describe in some detail “what 
kind of people” chose to enter careers in trade 
unions and management, an instrument which 
provided details of prior life experiences hy- 
pothesized to be related to specific person- 
ality trait development appeared useful. 

An analysis, within a psychoanalytic frame- 
work, of the trade union and management 
“ideal types” enabled the postulation of cer- 
tain early life experiences and family constel- 
lations which seemed basic to present behav- 
ior. Those were then refined into the follow- 
ing five variables: * 


1. Relationships with authority figures 

2. Maternal control of the family 

3. Age and severity of independence train- 
ing 

4. Success in peer group relationships 

5. Social conformity of the family 


Items pertaining to these variables were 
constructed and then submitted to a group of 
judges for screening in order to increase the 
face validity. Employing the scores from both 
the union and management groups, odd-even 
reliability coefficients of the individual cate- 
gories were computed and corrected by the 
Spearman-Brown formula, with all traits scor- 
ing sufficiently high to warrant confidence in 


8 Items pertaining to the first four variables were 
directed to surveying such areas as: (1) the kinds of 
experiences the respondents had with their parents, 
teachers, employers, etc.; (2) which of their parents 
exerted the most significant control over the family; 
(3) the age at which the respondent was first given 
certain responsibilities, was permitted to have dates, 
seek part-time employment, etc.; (4) the status held 
by the respondents in their play groups, clubs, teams, 
and formal organizations. 

Social conformity, (5), was appraised by first tabu- 
lating various aspects of behavior, as recalled by Ss, 
of their immediate neighbors during childhood and 
adolescence and then later in the inventory, present- 
ing the selected items in a different order and hav- 
ing Ss indicate what their family did in these situa- 
tions. The discrepancy score, between the behavior 
of neighbors and the respondents’ families, was em- 
ployed as the indicator of Social conformity. 
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TABLE 1 


EDUCATION OF UNION AND MANAGEMENT TRAINEES 
AND THEIR FATHERS 


Trainees Fathers 


Grade 
Completed Trainees Fathers 


4 years college 8 0 38 13 
3 years college 3 0 2 1 
2 years college 8 3 0 3 
i year college 5 0 0 2 
High school 

graduate 11 9 0 10 
Less 5 28 0 11 


the instrument for group use.* This custom- 
built Biographical Inventory comprised the 
fourth study instrument. 

The average union age of 24.83, with a 
range of 20-40 years, when compared to the 
management mean of 27.75, with a range of 
22-40 years, is so close as to suggest that any 
intergroup differences on the tests could not 
be assumed to be a function of an age dis- 
crepancy. 

As the Grace Line places a premium on 
educational attainment, it is not surprising to 
note in Table 1 that the management group 
generally appears to have completed consid- 
erably more formal education than the un- 
ion trainees. Though the administrators of 
the ILGWU training program suggest that 
amount of formal education is relatively un- 
important, 60% of the union trainees state 
that they completed at least one year of col- 
lege training. Almost half of the ILGWU stu- 
dents allege that they had completed two or 
more years of college and many suggest that 
financial need was the main reason for their 
not continuing their formal education. Fur- 
thermore, due to the professional approach of 
the institute staff and the strong academic 
content of their training, the possibility that 
some of the ILGWU students considered their 
attendance as a form of college or professional 
schooling should also be considered. 

Two factors should be noted in considering 


4 Authority relationships .90 
Material control 62 
Independence training 58 
Peer group relationships 83 


Social conformity 
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TABLE 2 


Marita Status OF UNION AND MANAGEMENT 
TRAINEES 


Group Single 


Married Total 
Union 26 14 40 
Management 18 22 40 


the greater number of married Grace Line 
trainees (Table 2). A number of the manage- 
ment group had been married for only a very 
few years and as their mean age is almost 
three years greater than that of the ILGWU 
group, the psychological meaning of the dif- 
ference in marital status may not be as strik- 
ing as it appears. During the selection in- 
terviews, the ILGWU screening committee 
stresses a willingness of the candidates to re- 
main relatively detached from close family 
ties. Those who were married and were fa- 
thers were apprised of the long and irregular 
hours that they would be called upon to work 
and particular emphasis was laid upon the 
fact that they could, upon completion of their 
training, be assigned to any area of the United 
States. Additionally, the applicants were pro- 
vided with details of vigorous antilabor ac- 
tivities and tales of physical violence which 
had occurred in some of the communities to 
which they could be assigned. 

Discussions pertaining to why certain peo- 
ple select union rather than management ca- 
reers frequently raise the question of whether 
“their fathers were union members’? Table 3 
indicates that 80% of the fathers of the 
ILGWU trainees were reported as having held 
membership in a trade union as opposed to 
only 32% of the fathers of the Grace Line 
trainees. 


TABLE 3 


Union MEMBERSHIP OF FATHERS OF UNION AND 
MANAGEMENT TRAINEES 


(N = 40 in each group) 


Union Union 
Member Member 
Group Yes No 
Union 33 7 
Management 13 27 
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The educational achievement of the trainees’ 
fathers (Table 1) significantly favors the man- 
agement group. Though it is unlikely that 
there is a very high relationship in these par- 
ticular groups between formal education and 
the intellectual or political atmosphere of the 
home, the fact that 47% of the fathers of the 
management group completed at least one 
year of college as opposed to only 7% of the 
union group’s fathers is worthy of note. 

As the present study was intended to look 
for intergroup differences, intellectual ability 
was considered as a variable to be evaluated 
in the same light as personality or develop- 
mental differences. 

Prior to the administration of the regular 
test battery, all Ss included in the study were 
given the Wonderlic Personnel Test. Consid- 
ering the marked intergroup differences in 
formal education and the preference given to 
more intelligent applicants by the Grace Line, 
the higher mean score (Table 4) of the Grace 
Line Ss is not surprising. 

In considering the Wonderlic test scores as 
a measure of intelligence it should be noted 
that the union Ss were at somewhat of a dis- 
advantage as the test, which is constructed 
along lines of increasing item difficulty, in- 
cludes a number of questions toward the end 
which would be particularly easy for indi- 
viduals who have had advanced training in 
mathematics. As the majority of the Grace 
Line trainees reported that they received 
their undergraduate training in engineering 
and business administration their observed 
superiority is perhaps not as significant as it 
initially appears. 

The observed differences in intelligence test 
scores of the experimental and control groups 
raised the possibility that intergroup differ- 
ences in personality, values or biographical 
data may be a function of intelligence rather 


TABLE 4 


WoONDERLIC PERSONNEL TEsT ScorES OF UNION 
AND MANAGEMENT TRAINEES 


Range 


14-35 
17-45 


than the union-management variable. Gough 
(1957), however, found almost no relation- 
ship between intelligence and scores on the 
California Personality Inventory; and corre- 
lation ratios computed between the Wonderlic 
scores and the Biographical Inventory cate- 
gories and the Study of Values traits disclosed 
no significant ratios, with the exception of 
social values (—.42). Though this negative 
relationship between social values and intelli- 
gence is in disagreement with other published 
studies (Pintner, 1933), the correlation was 
taken into consideration in evaluating the to- 
tal study results. 

On the basis of the ILGWU job specifica- 
tions and job duties and the general stereo- 
types of trade union and management leaders, 
15 hypotheses were formulated: 


As measured by the Sims SCI Occupational Rating 
Scale 

1. The union group identifies with a lower social 
class than does the management group. 

As measured by the Allport-Vernon Study of Values 

2. The union group reveals higher social values 
than does the management group. 

3. The union group demonstrates lower economic 
values than does the management group. 

As measured by the California Psychological Inven- 
tory 

4. The union group scores lower on self-control 
than does the management group. 

5. The union group scores higher in psycho- 
logical-mindedness than does the management 
group. 

6. The union group scores higher in responsi- 
bility than does the management group. 

7. The union group scores higher in socialization 
than does the management group. 

8. Both the union and management groups score 
high in dominance. 

9. Both the union and management groups score 
high in self-acceptance. 

10. The union group scores higher in femininity 
than does the management group. 

As measured by the Biographical Inventory 

11. The union group has had a greater number of 
oppositional relationships with authority fig- 
ures than does the management group. 

12. The union group has had a greater number of 
rejecting experiences in peer group relation- 
ships and social participation than does the 
management group. 

13. The union group, during childhood and ado- 
lescence, was supervised by a family which, in 
contrast to the management group, 

a. was maternally controlled. 
b. stressed social nonconformity. 
c. valued early independence training. 
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TABLE 5 


Bogard 


MEANS AND STANDARD DEVIATIONS OF UNION AND MANAGEMENT GROUPS 
ON EACH OF THE HypoTHEsEs TESTED 


40 in each op) 


Social class ntification 


2. Social values 46.27 
3. Economic values 37.75 
4. Self control 27.65 
5. Psychological mindedness 12.73 
6. Responsibility 29.78 
7. Socialization 33.97 
8. Dominance 33.20 
9. Self acceptance 23.72 
10. Femininity 16.75 
11. Oppositional relationships 

with authority figures 12.19 
12. Rejecting peer group 

relationships 3.98 
13. Maternal control 6.13 
14. Nonconformity 4.55 
15. Early independence training 8.77 


* Significant at less than the .05 level. 
** Significant at less than the .01 level. 


Union 


d 8. 045** 
5.96 32.00 7.08 9.754** 
7.71 45.95 5.78 5.387** 
7.52 29.25 7.03 982 
2.63 13.33 2.15 1.110 
3.99 32.65 4.77 —2.917> 
5.55 36.02 4.30 — 1.846» 
4.38 34.90 1.64 . 
3.01 24.30 8.64 
3.01 15.50 2.99 1.862* 
7.96 6.22 3.79 4.279**4 
2.82 2.03 1.59 3.800** 
3.47 5.87 3.23 ..3384 
3.23 2.70 2.25 2.932** 
2.63 6.20 2.84 4.208** 


® The instruments employed for the measurement of each trait are indicated in Footnote 4. 


b Since a one-tailed critical region gives no protection against alternatives in the direction opposite to the one hypothesized, 
it is obviously possible that such an alternative might be true and not be revealed by the presently employed one-tailed test. The 
computed ¢ score, which is less than the first percentile, would suggest that the management group can be described as scoring 
higher than the union students in these traits. 

© The hypotheses pertaining to Traits 8 and 9 did not specify intergroup differences, but hypothesized that the groups would 
score high when compared with other occupational groups. Comparison with published norms supported the hypotheses. 

8. 


On each of the tests included in the bat- 
tery, the mean and standard deviation of the 
scores of each group were computed. The sig- 
nificance of observed differences was deter- 
mined by ¢ tests employing a one-tailed criti- 
cal region. The maximum level of significance 
allowed was .05. 


RESULTS 


Generally the results (Table 5) justify the 
conclusion that the two groups are quite dif- 
ferent. The union trainees are somewhat less 
intelligent and less aggressive than the Grace 
Line group, although both groups tend to 
score above the general population in both of 
these traits. They are more altruistic than the 
management trainees, less concerned with do- 
ing things that are practical, and reveal a 
greater number of interests usually found in 
individuals who deal with people, art, music, 
and languages as opposed to those who usu- 


ally deal with objects and office records. Their 
social class identification is significantly lower 
than that of the management group. The 
ILGWU trainees, in light of the results, ap- 
pear to be less responsible, tend to manifest 
less social maturity, and reveal a long-stand- 
ing propensity to overt conflict with authority 
figures. 

During their formative years, the manage- 
ment trainees appear to have been more suc- 
cessful in their peer group relations, to have 
received less early independence training and 
to have been reared by parents who were 
generally more conforming to social pressures 
than were those of the ILGWU students, 
whose fathers received considerably less for- 
mal education and more often held union 
membership than did the fathers of the Grace 
Line trainees. 

The wide —_— between the ILGWU 
trainees’ measured altruistic feelings and their 
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reported significant lack of success in actually 
dealing with other people is striking. The 
Grace Line trainees, according to middle-class 
standards, seem to be a more emotionally 
sound group, and reflect more clearly the 
stress placed on social maturity by the un- 
ion and management program officials. The 
ILGWU Ss appear to reveal the strong so- 
cial feeling sought by the union. They also 
seem to be aggressive, assertive, and impul- 
sive, and perhaps, in meeting the entrance re- 
quirements, bring together such personality 
traits as would suggest, in an overall view, 
a lack of emotional maturity. 

At the present time, no data pertaining to 
the Ss’ relationships with each other, with 
their supervisors and with those with whom 
they must deal in the carrying out of their 
job duties is available. Accordingly, there 
are no external criteria immediately available 
against which the results of this study can be 
compared and, furthermore, any tendencies to 
answer questions according to a union or 
management stereotype, rather than to reflect 
their own true feelings, cannot be assessed. 


DIscussIOoN 


The literature has revealed a number of 
studies (Roe: 1950, 1952; Rieger, 1949; 
Daniels & Hunter, 1949) whose authors con- 
cluded that occupational personalities do 
exist, but who noted that caution must be 
exercised in putting their results to practical 
use. Individual differences within the groups 
investigated were often so great that though 
the mean trait scores significantly differed, 
the results had little operational utility. 

Within-group differences, tending to mini- 
mize the practical significance of between- 
group differences, were frequently observed in 
this study. Though the majority of the hy- 
potheses investigated produced statistically 
significant intergroup differences, the con- 
fidence with which the results could be put 
to use was somewhat impaired. Individual 
scores were frequently such that they could 
have placed Ss in either of the groups studied. 

In describing the general kind of person 
who chooses to enter trade union leadership 
through the ILGWU training institute, or the 
field of business management through the 
Grace Line training program, the results 
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‘should be valuable. The conclusions permit 
the construction of generalized typical 
ILGWU and Grace Line trainees and can 
provide some basis for individual counseling 
and for improved selection procedures. 

The findings, however, though obviously of 
particular interest to the executives of the 
ILGWU and Grace Line, provide other more 
general implications. 

It is apparent, in studies of the relationship 
between personality and occupational entry, 
that the efforts preceding and upon which in- 
strument selection is based, are of uppermost 
importance. Though ideally the surveyed 
groups should first be studied by carefully 
controlled experimental methods, it has been 
shown, where the opportunity for lengthy pre- 
study investigations is not possible, that an 
effective test battery can be selected by first 
studying the subject group by means of care- 
ful, though not necessarily experimental meth- 
ods, in which an emphasis is placed on teas- 
ing out the personality demands of the job. 
In the present investigation, instrument se- 
lection and the formulation of hypotheses fol- 
lowed discussions with executives of the two 
organizations studied, in which ideal types, 
based in part on generally desired character- 
istics and on recalled incidents “which distin- 
guished the men from the boys,” were con- 
structed: the latter technique being, in some 
way, similar to the “critical incident tech- 
nique” reported by Flanagan (1949). 

As an aid to understanding better presently 
revealed personality traits, “relevant” early 
life experiences were also explored. Biographi- 
cal inventory trait categories, referring to spe- 
cific past behavior and family structure, were 
hypothesized in order to expose such ante- 
cedent experiences as may have significantly 
contributed to presently revealed personality 
traits and vocational choice. Though a psy- 
choanalytic approach was employed in the 
postulation of particular life experiences, it 
seems obvious that regardless of the person- 
ality theory used present-day behavior can be 
more meaningfully understood when viewed 
within a developmental framework. 

In most of the traits studied, the picture of 
the two groups revealed was surprisingly close 
to the ideal types sought by the particular 

union and management program administra- 
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tors. A comparison of the general character- 
istics of the Grace Line S with prevailing ex- 
ecutive stereotypes and with the management 
trainees of another major industrial organiza- 
tion suggests that it is likely that they are in 
many ways representative of the occupation. 
It is probable, however, that the typical 
ILGWU trainee is not representative of the 
typical American trade union leader. The 
often reported increase in employment of well 
paid specialists by major American trade un- 
ions and data available in published biog- 
raphies of leaders of other unions (Minton & 
Stuart [1937]; Wechsler [1944]; Madison 
[1950]) suggests that the avowed “dedica- 
tion to trade union philosophies” and the ac- 
ceptance by the ILGWU trainees of the fact 
that their earnings will never be high are just 
two of the many areas in which they may be 
observed to be atypical. 

An empirical attempt to construct an occu- 
pational personality picture of the “trade un- 
ion leader” would entail a study of consider- 
ably greater size and scope than the present 
one. 

Significant is the consistency shown be- 
tween Ss’ early life @axperiences and present- 
day behavior, as suggested by the Study of 
Values and the California Personality Inven- 
tory. The role of societal pressures usually 
considered peculiar to certain social classes is 
also clearly demonstrated. Adding support to 
the literature suggesting a direct relationship 
between social stratification and personality, 
the management group, which on the Sims 
Occupational Rating Scale identified with a 
significantly higher social class, described 
early life experiences in which the conform- 
ity, child rearing techniques, and the stress of 
interpersonal skills experienced were consid- 
erably different from those encountered by 
the ILGWU students. 

Though scoring significantly higher than 
the union group in areas of present-day ag- 
gression, the management students, consistent 
with their suggested early training experience, 
appeared to have channelized their drives into 
more socially acceptable areas. Instances of 
overt conflict with authority are less fre- 
quently seen, and the need to do things that 
are practical is strong. 


Though both of the groups investigated 
were accepted for training only after being 
“carefully screened,” within each of their 
training groups a few individuals were dis- 
closed who seemed quite unlike their col- 
leagues. Whether the personality character- 
istics of these variants will strongly influence 
their expected success is of obvious future in- 
terest, particularly as the homogeneity of each 
group is strong and the apparent differences 
between individuals and the groups norms 
are, in some instances, striking. 

The study has added support to some gen- 
eral vocational theory and posed additional 
areas for further investigation. It has been 
demonstrated that significant personality dif- 
ferences exist between the occupational groups 
studied and that behavioral data, significantly 
differentiating two occupations and closely re- 
lated to present-day personality and values, 
can be effectively elicited through custom- 
built biographical inventories. 


SUMMARY 


In an attempt to study further the rela- 
tionship of personality to occupational entry, 
the present investigation undertook to deter- 
mine whether individuals who were consid- 
ered worthy material for training for leader- 
ship positions by the ILGWU and the Grace 
Line, Inc., were different from each other. 

Basing the selection of test instruments and 
hypotheses on job analyses and ideal types, 
which stressed personality demands of the 
job, 40 trainees in the formal training pro- 
grams of each organization were studied. In 
order to investigate what were considered to 
be the most important personality traits, the 
Sims SCI Occupational Rating Scale, the All- 
port-Vernon-Lindzey Study of Values, the 
California Psychological Inventory, and a cus- 
tom-built, trait-centered Biographical Inven- 
tory were employed and 15 hypotheses were 
formulated. 

Of the 13 hypotheses which proposed dif- 
ferences in specified directions, seven were 
significant at less than the .01 level and one 
at less than the .05 level. Two hypotheses 
which did not specify intergroup differences, 
but proposed that the groups would score 
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high when compared with other occupational 
groups, were also supported. 

Generally, the groups appeared to be com- 
posed of significantly different trainees. Indi- 
vidual differences within groups were, how- 
ever, often considerable. In addition to sup- 
porting some general vocational theory, the 
results point to the practicability in occupa- 
tional studies, when the opportunity for ex- 
perimental prestudy is not possible, of pre- 
ceding test selection with personality-factor 
job analyses, and to the discriminating ability 
of custom-built, trait-centered biographical 
inventories. 
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